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. METASTATIC BREAST AND COLON CANCER REGULATED GENES 

TECHNICAL FIELD OF THE INVENTION 

This invention relates to methods for predicting the behavior of tumors. 
More particularly, the invention relates to methods in which a tumor sample is 
5 examined for expression of a specified gene sequence thereby to indicate propensity for 
metastatic spread. 

BACKGROUND OF THE INVENTION 

~ Breast cancer is one of the most common malignant diseases in women, 

with about 1,000,000 new cases per year worldwide. Colon cancer is another of the 

10 most common cancers. Despite use of a number of histochemical, genetic, and 
immunological markers, clinicians still have a difficult time predicting which tumors 
will metastasize to other organs. Some patients are in need of adjuvant therapy to 
prevent recurrence and metastasis and others are not. However, distinguishing between 
these subpopulations of patients is not straightforward, and course of treatment is not 

J5 easily charted. There is a need in the art for new markers for distinguishing between 
tumors which will or have metastasized and those which are less likely to metastasize 

SUMMARY OF THE INVENTION 
; Vf < ^ is an object of the present invention to provide markers tot 

distinguishing between tumors which will or have metastasized and thbse which are less 
20 Jikely to metastasize. These and other objects of the invention are provided by one or 
more of the embodiments described below. 

One embodiment of the invention provides an isolated and purified 
... human protein having an amino acid sequence which is at least 85% identical to an 
amino acid sequence encoded by a nucleotide sequence selected from the group 
25 consisting of SEQ ID NOS: 1-63 or the complement thereof. 

Another embodiment of the invention provides a fusion protein which 
comprises a first protein segment arid a second protein segment fused to each other by 



WO 00/22130 



2 



r 

PCT/US99/24222 _ 



means of a peptide bond, the first protein segment consists of at least si* contiguous 
amino acids selected from an amino acid sequence encoded by a nucleotide sequence 
selected from the group consisting of SEQ ID NOS:i-fo or the complement thereof. 

Yet another embodiment of the invention provides an isolated and 
5 purified polypeptide consisting of at least six contiguous amino acids of a human 
protein having an amino acid sequence encoded by a nucleotide sequence selected from 
the group consisting of SEQ ID NOS:l -63 or the complement thereof. 

Still another embodiment of the invention provides a preparation of 
antibodies which specifically bind to a human protein which comprises an amino acid 
10 sequence encoded by a nucleotide sequence selected from the group consisting of SEQ 
ID NOS: 1 -63 or the complement thereof. 

Even another embodiment of the invention provides an isolated arid 
purified subgenomic polynucleotide comprising at least 1 1 contiguous nucleotides of a 
nucleotide sequence which is at least 96% identical to a nucteotide seque^e selected 
15 from the group consisting of SEQ ID NOS: 1^63 or the cbmplemerit thereof. 

Another embodiment of the invention provides an isolated and purified 
gene which comprises a coding sequence comprising a nucleotide sequence selected 
from the group consisting o^ 

Yet another embodiment of the invention provides a method for 
20 determining metastasis in a tissue sample. An expression product of a gene which 
comprises a coding sequence selected from the group consisting of SEQ ID NOS:L 2 T 
4, 5, 9, 1 113, \£ 18, 19, 20, 22, 24;26, 29, 30, 33, 35, 36, 3^i: 45,# 52, 55, 57, 
58, 60, 63-66, 69-74, 76, 80, 82, and 83 is measured in a tissue sample. A tissue sample 
which expresses the product is categorized as metastatic. 
25 Still another embodiment of the invention provides a method for 

determining metastasis in a tissue sample. An expression product of a gene which 
comprises a sequence selected from the group consisting of SEQ ID NOS:X 7. 8, 10. 
12, 15-17. 21.23, 28, 31.34, 3^42-4^46, 47. 49-51. 5\ 59, 61. 62, 67. 68 ? 75 r 77-79. 
81, 84, and 85 is measured in a tissue sample. A tissue sample which does not express 
30 the product is categorized as metastatic. 
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Even, another embodiment of the invention provides a method for 
determining metastatic potential in a tissue sample. An expression product of a gene 
which comprises a sequence selected from the group consisting of SEQ ID NOS:l, 2, 
4, 5^, 13, 14, 18, 19, 20, 22, 24, 26, 29, 30, 33. 35, 36, 38-41, 45, 48, 52, 55, 57, 
5 58^6$), 63-6& 69-74, 76, 80, 82, and 83 is measured in 3 tissue sample. A tissue sapple 
which expresses the product is categorized as laying metastatic potential. 

A further embodiment of the invention provides a method for 
; determining metastatic potential in 11 tissue sample. An expression product of a gene 
which comprises a sequence selected from the group consisting of SEQ ID NOS;3, 7, 
10 8, 10, 12, 15.17, 21 ? 23, 28, 31, 34, 3^, 42-44 ? 46, 47, 49-51 , 53; 59, 61, 62, 67, 68, 75, 
77-79, 81, .84,„and 85 is measured in a tissue sample. A tissue sample which does not 
express the product is categorized as having metastatic potential. 

Another embodiment of the invention provides a method of predicting 
Jhe propensity for metastatic spread of a breast tumor preferentially to bone or lung. An 
15 expression product of a gene which comprises a sequence selected from the group 
consisting of SEQ ID NO: 1, 5, 1 1, 18, 20, 22, 24, 30, 33, 35, 36, 38 ? 45, 52, 58, 65 r 66, 
70, 74, 76 ? and 80 is measured in a breast tumor sample. A breast tumor sample which 
'expresses the product is categorized as having a propensity to metastasize to bone or 

20 , Even another embodiment of the invention provides a method of 

. : , . ^predicting propensity for metastatic spread of a breast tumor preferentially to lung. An 
expression product of a gene which comprises a sequence selected from the group 
oonsis^ng of SEQ ID NOS:2, 4, 9, ; 13, 14,19, 26, 29, 39-4 1 , 48, 55, 57, 60^ 63, £4, 72, 
73, 82, and 83 is .measured in a breast tumor sample. A breast tumor sample which 

25 expresses the product is characterized as haying a propensity to metastasize to lung. 

, ; Still another embodiment of the invention provides a method of 
predicting propensity for metastatic spread of a colon tumor. An expression product of 
a gene which comprises the nucleotide sequence shown in SEQ ID NO:56 is measured 
in a colon tumor sample. A colon tumor .sample which expresses the product is 

30 characterized as having a low propensity to metastasize. 
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Even another embodiment of the invention provides a method for 
determining metastasis in a tissue sample. An expression product of a gerici which 
comprises a coding sequence selected from ihe group consisting of SEQ ID NOSiX '7* 
8,10, 12, .\i\T 9 i\* 2^ 25, 28^ 31, 34; 37f 42-44, 46. *7, 49, 61:62,67, 68, 75, 77-79, 

5 81, 84, arid 85 is measured in a tissue sampie. A tissue sample which expresses the 
product is categorized as non-metastatic. 

Yet Mother embodiment of Ae invention provides a method for 
determining metastasis in a tissue sample. Ah expression product of h g£ne Svhich 
comprises a codilig sequence Elected from the group consisting of SEQ ID NOS:3, 7, 

10 8, ia; 12, 15-17,21, 23, 25^28; 31, 34^ 37, 42-41 46:47, 4^,61:62, 67, 68 ? 75, 77-79, 
81, 84 and 85 is measured in a tissue sample. A tissue sampie which does not express 
the product is categorized ai metastatic. 

The invention thus; provides the art with a number of genes and proteins, 
which can be used as markers of metastasis; these are useful for more: rationally 

15 prescribing the course of therapy for breast or colon cancer patients. 1 

DETAILED DESCRIPTION 

It is a discovery of the present invention that a number of genes are 
differentially expressed between metastatic cancer cells, especially cancer cells of the 
breast and colon, and non-metastatic cancel cells. These genes are metastatic marker 
20 genes! TTiis infonnation can ibe utilized to make diagnostic reagents specific fdr the 
expression products of the differentiaily expressed genes. It can also be tised in 
diagnostic and prognostic methods which will help clinicians in planning appropriate 
treatment regimes for cancers, especially of the breast or colon: 

Some of the polynucleotides d herein represent hovel genes 

25 ^icharediffo^ 

which have a potential to metastasize. SEQ ID NOS 1 63 represent novel metastatic 
marker genes (table 1). SEQ ID NOS:i^-85 represent known genes which have been 
found to be differentially expressed in metastatic relative to riori rnidastatic cancer cells 
(Table 2). Some of the metastatic marker gene$ disclosed herein are expressed in 
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metastatic cells relative to npri-metastatic } cells, particularly in breast cancer cells which 
. metastasize to bone and lung (SEQ ID NOStl. 5, I I, 18, 20, 22, 24, 30, 33, 35, 36, 38, 
45, 52, 58, 65, 66, 70, 74, 76^ and 80) r One ^ metastatic markef gene (SEQ ID NO;56) is 
expressed in npn-metastotic br^ in colon cancer cells with low 

5 .metastatic potential. Other metastatic marker genes are expressed in metastatic cancer 
cells; particularly in breast ,canper ^ells which met^mize only to lung (SEQ ID NQSS. 
4, 9,13, 14, 19, 26, : 2?, ; 39-41, 48, 55, 57, 60, 63, 64, 72, 73 ? 82, and 83) ; Stili other 
metastatic marker genes (SEQ IDNO^^ 37, 42- 

r 44, 46, 47, 49, 6 l v ^ 85) are ^pressed in cancer cells 

10 ;; which do not typically nietastasize, particularly, in breast cancer cells. Identification of 
these relationships and markers permits the formulation of reagents and methods as 
. further described belpww i ^hc;r;aie^ 
a nucleotide s^quenpe .shoym m SE<5 JD NQS;6, 27, 32, and 54, can be used to identify 
:; ; t cai}cero^s4i^u^ tl t 

15 Sequences of metastatic marker genes are disclosed in SEQ ID NOS:l- 

85; Metastatic marker proteins can be made by expression of the disclosed 
polynucleotide, molecules.. Amino acid sequences encoded by novel polynucleotides of 
the>inventiqn:C^ a translation program for each qf three 

reading frames, for a disclosed sequence and its complement. Complete polynucleotide 

20 sequences t can, .be, obtained. , . by cbrompspme . % walking, screening of libraries for 

, . r- overlapping^ oth<er techniques welj known in the art. 

. Reference to metastatic marker , nucleotide or amino acid sequences 

t f .includes „yarimt^ jyhich have similar «>q>ression ^ re ' at *Y e t° non - 

*«■-. metastatic cells i; as described below. : Metastatic mato^ in 

25 r length from full-length^^ prpteii^jand contain at least 6, 10, J2, 15, 

1 8, 20, 25, 30, 35, 40, 45, 50, 55, 60 r 65. 70, 75, 80, 85, 90. 95, 1 00, 1 20, 140, 160, 1 80, 
or 200 or more.contiguQus ai^no 

f , ; - yariants ; °f marker proteii^ 

marker protein or; polypeptide. variants can be naturally or non-naturally occurring. 

30 Naturally occurring metastatic marker protein or .polypeptide variants are found in 
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liumans or other species and comprise amino acid sequences which are substantially 
identical to the proteins encoded by genes corresponding to the nucleotide sequences 
shown in SfiQ ID 1^08:1-85 or their complements. Non-natundly occurring metastatic 
marjker protein oi polypeptide variants which retain substantially the same differential 
5 expression patterns in metastatic relative to noti-metastatic cancer cells as naturally 
occurring metastatic majicer protein or polypeptide variant are als6 included here. 
Preferably, naturally or nori-naturally occurring metastatic markfer protein or 
polypeptide variants have amino acid Sequences which are at least 85%, 90%, or 95% 
identical to amino acid sequences encoded by the nucleotide sequenced shown in SEQ 
10 ID NOS:! 85, More preferably, tti£ motecUles are at least 98% or 99% identical. 
Percent sequence identity between a wild-type protein 6t polypeptide arid a variant is 
determihol by aligning the wild-type protein or polypeptide With viriint to obtain 
the greatest number of amino acid lifiatches, as is known in the Sit, counting the tiumber 
of amino acid matches between th£ Wild-type and the variant, and dividing thfe total 
15 number of matches by the total number of amino acid residues of the wild-type 
sequence. . 

Preferably, amino acid changes in metistiitic niaiker protein or 
polypeptide variants are conservative ammo ^cid changes, /. e., substitiitibris of similarly 
chained or unfcharged aimirio acidsl A conservative amino acid tchange involves 
20 substitution of one of a family of ariiirio acids which afe related in their side chains 
Naturally occurring amino acicls ^6 ^neraliy^ divided' into four families: sicidic 
(aspartate,' glutimate), ^c <iysiwr^igin&fev hisfidmeX non-polar (alanine, valine, 
leucine, isoieiicine^ methionine tryptbphih); and uncharged 

polar (glyciiC aspara^ j^rihe, ttenine, tyr^ 

25 Phenylalanine, tiyptophan; and tyrosine are som^imes classified jointly & aromatic 
amino acids. 

It is reachable to expect that an isolated replacement of a leiicirie with 
an isoleucine or valine, an aspartate with ^ ^utaitriate, a threonine with a serine, or a 
similar replacement of an amino acid with a structurally related amino atcid Will not 
30 have a major effect on the biological properties of the resulting metastatic marker 
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protein or polypeptide variant. Properties and functions of metastatic marker protein or 
polypeptide variants are of the same type as a metastatic marker protein or polypeptide 
comprising amino acid sequences.encoded by the nucleotide sequences shown in SEQ 
ID NOS:I-85, although the properties and functions of variants can differ in degree. 
5 Whether an amino acid change results in a metastatic marker protein or polj^eptide 
variant with the; appropriate differential expression pattern can readily be determined. 
For example, nucleotide probes can be fleeted from the marker gene sequences 
, disclosed herein, and used to detect marker gen? mRNA in Northern blots or in tissue 
sections, as is known in the art. Alternatively, antibodies which specifically bind to 
10 ; protein products of metastatic marker genes can be used to detect expression of 
metastatic marker proteins. 

Metastatic marker variants include glycosylated forms, aggregative 
. conjugates with other molecule?, and cpvalent conjugates with unrelated chemical 
.... moieties. Metastatic marker variants also include allelic variants, species variants, and 
15 ■ muteins. Truncations or deletions of regions which do not affect the differential 
expression of metastatic marker genes are also metastatic marker variants. Covalent 
. variants can be prepared by linking functionalities to groups which are found in the 
' amino acid chain or at the N- or C-terminal residue, as is known in the art. 

Full-length metastatic marker proteins can be extracted, using standard 
20 biochemical methods, from metastatic marker protein-producing human cells, such as 
, metastatic breast or colon cancer cells. An isolated and, purified metastatic marker 
protein or polypeptide is separated from other compounds which normally associate 
with a metastatic marker protein or polypeptide in a cell, such as certain proteins, 
carbohydrates, lipids, or subcellular organelles. A preparation of isolated and purified 
25 . metastatic marker proteins or polypeptides is at least 80% pure; preferably, the 
preparations are 90%, 95%. or 99% pure. 

s Metastatic marker proteins and polypeptides can also be produced by 

recombinant DNA methods or by synthetic chemical methods. For production of 
recombinant metastatic marker proteins or polypeptides, coding sequences selected 
30 from the nucleotide sequences shown in SEQ ID. NOS:I-85, or variants of those 
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sequence which eihcbde metastatic marker proteins, can be expressed in known 
prokaryotic or eukaiyoitic expression systems (see below). Bacterial, yeast, insect, or 
mammalian expression ^steins can fe used, as is known in the iairt. 

Alternatively, synthetic chemical methodsj such as solid phase peptide 
5 synthesis, can be used to synthesize a metastatic marker protein or polypeptide. 
General means for the production of peptides, analogs or derivatives are outlined in 
Chemistry and Biochemistry 6f Amino AcidC Peptides; And Proteins - A 
Survey of RecenF^ Inc:; pubK, 

New York (1983). Moreover, substitution of D-amino acids for the normal L- 
10 stereoisomer can be carried out to increase the Half-life of the molecule; Metastatic 
marker variants can be similarly produced. 

Non-naturaiiy occurring fusion proteins comprising at least 6, 8, 10, 12, 
15, 18, 20, 15, 30, 35, 46^ 50, 55, 60, 65, 70, 75; 80, 95 JoO, 120, 140, 160, 180, 

or 200 or more contiguous metastatic m^ker amino acids can also be cohstnicted. 
15 Human metastatic marker fusion proteins are useful for generating antibodies against 
metastatic marker amino acid sequences and for use in various assay systems. For 
example, metastatic marker fusiori proteins can be used to identify proteins ivhich 
interact with metastatic marker proteins and influence their functions. Physical 
methods, such as protein affinity chromatography; or libna^-based assays for protein- 
20 protein interactions, such as the yeast two-hybrid or phage display systems, can also be 
used for this purpose. Such 'methods are well known in the atrt and can also used as 
drug screens. 

A metastatic marker fusion protein comprises two protein segments 
fused together by means of k peptide bond. The first protein segment comprises at least 

25 6,8, 10, 1115; 18, 20, 25^30,35,40,^5,^ 65, 70,75^ *0,8l 90,95, TOO" 120, 
140, 160, 180, or 200 or more contiguous amino acids of a metastatic marker protein. 
The amino acids can lie selected from the amino acid sequences encoded by the 
nucleotide sequences shown in SEQ ID Nft)S:l-85 or frbiri Variants of those sequences, 
such as those described above. TKe first protein segment can also comprise a full- 

30 length metastatic marker protein. 
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The second protein segment can be a full-length protein or a protein 
fragment or polypeptide. The fusion protein can be labeled with a detectable marker, as 
is known in the art, such as a radioactive, fluorescent, chemiluminescent, or biotihylated 
marker. The second protein segment can be an enzyme which will generate a detectable 
5 product, such as P-galactosidase. The first protein segment can be ^-terminal or C- 
terminal, as is convenient 

Techniques for making fusion proteins, either ireombinaittly or by 
covalently linking two protein segments, are also well known. Recombinant DNA 
methods can be used to prepare metastatic marker fusion proteins, for example, by 
10 making a DNA construct which comprises coding sequences selected from SfiQ ID 
NOS:l-85 in proper reading frame with nucleotides encoding the second protein 
segment and expressing the DNA construct in a host cell, as described below. 

Isolated and purified metastatic marker proteins, polypeptides/ variants, 
or fusion proteins can be used as immunogens, to obtain preparations of antibodies 
15 which specifically bind to a metastatic marker protein. The antibodies can be used, 
inter alia, to detect wild-type metastatic marker proteins in human tissue and fractions 
thereof. The antibodies can also be used to detect the presence of mutations in 
metastatic marker genes which result in under- or over-expression of a metastatic 
marker protein or in expression of a metastatic marker protein with altered size or 
20 electrophoretic mobility. 

Preparations of polyclonal or monoclonal antibodies can be made using 
standard methods. Single-chain antibodies can also be prepared. Smgte-chain 
antibodies which specifically bind to metastatic marker proteins/polypNeptides, variants* 
or fusion proteins can be isolated," for example, from singtenchain immunoglobulin 
25 display libraries, as is known in the art. The library is Spanned" against ^metastatic 
marker protein amino acid sequences, and a number of single chain antibodies which 
bind with high-affinity to different epitopes of metastatic marker proteins can be 
isolated. Hayashi et aL y 1995, Gene 750:129-30. Singleihain antibc^ies ran also be 
constructed using a DNA amplification method, such as the ^polymerase chain reaction 
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(PCR), using hybridoma cDNA as a template. ThiriOn el al. y 1996, Eur. J. Cancer 
Prev. 5:507-1 1. 

Single-chain antibodies can be mono- or bispecific, and can be bivalent 
or fetravalent {instruction of tetravalent, bispecific single-chain anti todies is taught in 
5 Coloma and Morrison, 1997, NaL Biotechnol 75:159-63. Construction of bivalent, 
bispecific single-chain antibodies is taught in Mallender and Voss, 1994, J; Biol Chem. 
269:199-206. 

A nucleotide sequence encoding the single-chain antibody can be 
constructed using manual or automated nucleotide synthesis, cloned into DNA 

10 expression constructs using standard recombinant DNA methods, and introduced into 
cells which express the coding sequence, as described below. Alternatively, single- 
chain smtibodies can be produced directly using, for example, filamentous phage 
technology. Verhaar et pL y 1995, Inu J. Cancer 67:497-501; Nicholls et a/., 1993, J. 
Immunol. Metk J 65X\4\. 

15 Metastatic marker-specific atntibodies specifically bind to epitopes 

present in a full-length metastatic marker protein having an amino acid sequence 
encoded py a nucleotide sequence shown in SEQ ID NOS:l-85, to metastatic marker 
polypeptides, or to metastatic marker variants, either alone or as part of a fusion protein. 
Preferably, metastatic marker epitopes are not present in other human proteins. 

20 Typically, at least 6, 8, 10, or 12 contiguous amino acids are required to form an 
epitope. However, epitopes which involve non-cbhtiguous amino acids may require 
more, e.g., at least 15, 25, or 50 amino acids. 

Antibodies which specifically bind to metastatic marker proteins, 
Polypeptides, fusion proteins, or variants provide a detection signal at least 5-, 10-, or 

25 20-fold higher than a detection signal provided with other proteins when used in 
Western blots or other immunochemical assays. Preferably, antibodies which 
specifically bind to metastatic marker epitopes do not detect other proteins in 
immunochemical assays and can immunoprecipitate a metastatic marker protein, 
polypeptide, fusion protein, or variant from solution. 
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Antibodies can be purified by methods well known in the art. 
j^eferably : ,the antibodies are affinity purified, by passing the antibodies over a column 
4j to which a metastatic marker protein, polypeptide, variant, or fusion protein is bound. 

, o.The bound antibodies.can then be eluted from the column, for example, using a buffer 
5 with a high salt concentration. ' 

Subgenomic polynucleotides contain less than a whole chromosome, 
^ferably. the polynucleotide are intron : free. In a p^fen^^.embpd'imjatv the 
.polynucleotide molecules comprise a contiguous, s^uence of 15, 20, 25, 30, 

3^ 35, 40, 45, 50, 60, 70, 74, 80, 90, 100, 1^ 150, 154, 175, 182, ^,.243^ 
10 nucleotides selected fn>m ; S^^ the complements thereof. The 

complement of a nucleotide sequence shown in .SEQ ID NQS.1-85 is a contiguous 
i nucleptide sequence which forms Watspn-Crick base pairs with a contiguous nucleotide 
sequence shown in SEQ ID NOS:I-85. The complement of a nucleotide sequence 
sho^yn in SEQ ID NOS : 1 -85 (the antis^nse strand) -is also a subgenomic poijiiucleotide, 
15 and can be used provide; marker protein antisense oligonucleotides. Double-stranded 
... pplynticleptides ( which comprise one of the nucleotide sequences shown in SEQ ID 
.:NOS:l-85. ^are afeo, sybgenomic ^lynuclpotides. Metastatic marker ppotein 
subgenomic polynucleotides also include polynucleotides which encode metastatic 
. inarker p^p|ein-specific single-ehain antibodies and^rihozymgs, or . fusion proteins. 
20 . comprising irotastatic m^er protein am 
w : . : ^generate, nucte^ of 

. ;. ^eta variants* as well as homologous nucleotide sequences 

which are at least 85%, 90%, 95%, 98%, or 99% identical to the nucleotide sequences 
. 'shown, in- SEQ ID NOS: 1-85, are also metastatic marker subgenomic polynucleotides. 
£5 typically, homqlogous metastatic marker subgenomic polynucleotide sequences can be 
confirmed by hybridization under stringent conditions, as is known in the art. Percent 
sequence identity between wild-type and homologous variant sequences js determined 
by aligning the wild-type polynucleotide with the variant to obtain the greatest number 
of nucleotide matches, as is known in the art, counting the number of nucleotide 
30 matches between the wild-type and the variant, and dividing the total number of 
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matches by the total number df nucleotides of the wild-tyjpe sequence. A preferred 
algorithm for calculating percent identify & the Smith -W^tdman homology search 
algorithm as implemented in MPSftCH piro^ram (Oxfoix! Molecular) Using art affine 
gap search with the following search parameters: g*£ open penalty of 1<K arid gap 
5 extension penalty of 1 « 

Metastatic marker subgenomic j^lyriiiei^ be isolated and 

purified free from other nucleotide sequences using standard riiicleic acid piirific^tw 
techniques. For example,'^ and probes &ri be used to isolate 

polynucleotide fragments which comprise nucleotide sequences encbdinjg a metastatic 
10 marker protein Isolated arid piirificxl sui^enbmie pdyhiiilecWdes are in preparations 
which are free or at least 90% ^ of ^ter moleciii^. 

Complementary DNA molecules wliicli ehbode ^ marker 
j^bteins cart lie made using reverse transcriptase, with metastatic iri^ as a 

templat^ the polymerase <Aain Action (iPCR) or Other Wnplii^tibn ide^ 
15 be used to obtain metastatic marker subgenomic ]j»l^udemides; u^mg either human 
genomic DNA or cDNA as a template^ as is kiiowri lit the art Alternatively, synthetic 
' Ctemisi^ techniques can be used to ^tto^ize metastatic ntaifker subgenomic 
polynucieoUdes which comprise Coding sequences for regioris; of metastatic marker 
proteins. singles:haih antibodies, or ribozyihes, or which comprise ahtisense 
20 oligonucleotides. The d^eneiacy of the genetic cbde allows alternate 'riUcleotide 
sequences to be syrithesi^d which will encode a me^atic iiiafker protein comprising 
amino acid sequences encoded by the niicleotiile sequences shdvm iW SEO It) NOSrl - 

85. J t :> " '' ^ " ' " ' ' ' 

Piirififed and iiwiked m*^ ^lynucle&ides can 

25 be used is primers to obt^n additidnal copies of the ^lynuclebtides or as piobes for 
•identifying wild-type and" mutant metastatic marker protein cbdifig 1&qtences. 
Metastatic marker subgenomic^ pbiytiucleotides can be used to expi^s metastatic 
marker mRNA, protein, polypeptides, or fusion proteins arid to generate metastatic 
marker antisense oligonucleotides and ribozymes. 
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, A metastatic marker subgenomic polynucleotide comprising metastatic 

marker protein coding sequences can be used in an expression construct- Preferably* the 
metastatic .marker, subgenomic polynucleotide is inserted into an expression plasmid 
(fpr example, the Ecdyson system, pIND,: In Vitro Gene). Metastatic marker 
5 subgenomic polynucleotides can be propagated in vectors and cell lines using 
techniques well known in the arL Metastatic marker subgenomic polynucleotides can 
be on linear or circular molecules. They can be on autonomously replicating molecules 
. or on molecules without replication sequences. They can be regulated by their own or 
v ;byo&erre^ 

10 . A hosjt cell comprising a metastatic marker expression construct can 

;r . then , be used to express all or a portion of a metastatic marker protein. Host cells 
*.-. ; corhprising metastatic marker expression constructs c^n be prokaryptic or eukaryptic. 
, A variety^ of host cells are available for use in Jbacterial, yeast, insect, apd human 
. , exjpriessipn systems and can. be used to express or to propagate metastatic marker 
15, ^ expression constructs .(see below). Expression xonstructs can be introduced into host, 
cells, using any technique known in the art. These techniques include transferrin- 
v , polycatipn-m^ with naked or encapsulated nucleic 

acids, liposome-mediated cellular fusion, intracellular transportation of DNA-coated 
latex be^ds, protoplast fusion, viral infection, electroporatipn, and calcium phosphate- 
20 ... ^nij^ijpLte$l trsurafectioii. t - ; , 

A metastatic marker expression construct comprises a promoter which is 
functional in a chosen host cell. The skilled artisan can readily select an appropriate 
v . . prompter from the large number of cell tj^-specific promoters known and used in the 
art. , The expression construct can also contain a transcription terminator which is 

25 functional in the host cell. The expression construct comprises a polynucleotide 

■ '■• * — ' V-.-i- ' ■ .v-'i : • \ s i '■)■ r-i> /.• ■■>•-.■ ,>>\ 

. segment whichjencodes all or a portion of the metastatic marker protein, variant fusion 
protein, antibody, or ribozyme. The polynucleotide segment is located downstream 
Jrom . ibp rl ^promoter; ..Transcription of. the polynucleotide segment initiates at the 
promoter. The expression construct can be linear or circular and can contain sequences, 
30 if desired, for autonomous replication. 
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Bacterial systems for expressing metastatic marker expression constructs 
include those described in Chang el a/.. Nature (1979) 275: 615, Goeddel etal., Nature 
(1979) 281: 544, Goeddel el al, Nucleic Acids (1980) «: 4057- EP 36,776, U.S. 
4,551,433, deBoer et al, Proc. Nai'l Acad Sci. USA (1983) M 21-25, arid Siebehlist et 

5 al. Cell (1980) 20: 269. 

Expression systems in yeast include those described in Hinrien et al, 

Proc. Nat'lAcad. Sci. f>&4 (1978) 75: 1929; Ito eial, J. Bacteriol. (\9«i) 153: 163; 

Kurtz et al., Mot Cell. Biol (1986) 6: i42fKunze et al:X Basic Microbibl (1985)25; 

141; Gleesone/ al, X Gen. Microbiol. (l&6y 1 32: 3459, Roggenkamp etal.,Mol Gen. 
10 Gene/. (1986) 202 :302) 6as et al. J. Bacteriol. (1984) 75& 1165; De Louvencourt ei 

al, X Bacteriol (1983) 154. 737, Van deri Berg et al. Bio/Technology (1990) 5: 135; 

Kunze e/ a/.. X Basic Microbiol. (1985) 25: 141;Cregge/aL Mol Cetl. 'BidL ( 1985) 5: 

3376, U.S. 4,837,148, US 4,929,555; Beach and Nurse, Nature (1981) 300: 706; 

Davidow e/ al.Curr. Genet. (1985) 70: 380, Gaillardin e/ Curr Genet. (1985) 70: 
15 49* Ballance e/ al. Biochem. Biophys: ResSCommun (1983) 772: 284-289; Tilburn e/ 

o7., Gene (1983) 2(5: 205-221^ Yelton e/ a/., Proc. Nat 'I Acad Sci. USA (1984) #7: 

1470-1474, Kelly and Hynes, EM%0 X. (1985) * 475479; EP 244,234, and WO 

91/00357. 

Expression of metastatic marker expression constructs in insects can be 
20 carried out as described in U.S. 4,745,051, Friesen el al. (19&J The Regulation of 
Baculovirus Gene Expression* in! The Molecular Biol^ Baculoviruses (W. 
Doerfler, edl), EP 127,83$, EP 155,476, and Vlak e/a/.. X Gen. Virol (19^) 69: 765- 
776, M\\ter et al. Ann tiev Micfobioi. (\9W) 42: 177, Carbonell et al. Gene (1988) 
73: 409, Maeda et al. Nature (1985)375: 592-594, Lebacq-Verheydeh etal. Mol. Cell. 
25 Biol. (1988) 8: 3129; Smith et al, Proc. Nat'l Acad. Sci USA (1985) S2: 8404. 
Miyajima et al. Gene (1987) 55: 273; and Martin et a/ ' D/vi (1988) 7:99. Numerous 
baculovual strains and variants and corresponding permissive insect host cells from 
hosts are described in Luckow eial, Bio/Technology (19M) 6: 47-55? Miller et al, in 
Genetic Engineering (Setlow, J.K. et al. eds.). Vol.8 (Plenum Publishing, 1986), pp. 
30 277-279. and Maeda et al. Nature. (1985) J75: 592-594. ■ 
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Mammalian expression of metastatic marker expression constructs can 
be achieved as described in Dijkema et aL; EMBQ J. (1985) 4: 761, Gorman et ai t 
Proc. Natl Acad Sci. USA (1982b) 79: 6777, Boshart et aL Cell (1985) 41: 521 and 
; . U.S.* 4,3P^ expression of metastatic marker 

5 expression constructs can be facilitated as described in Ham and Wallace, Meth Em; 
; (1979) 58: 44, Barnes and Sato, Anal Biochem. (1980) 102: 255 ? US. 4,767,704, US 
4,657,866, US 4,927,762, US, 4,^60,655, WO 90/103430, WO 87/00195, and U S. RE 
30,985. . . .. r , v ;r ^ , ? , _ 

t , . , .. . , t , Subgenomic polynucleotides of the invention can also be used iii gene 
10 delivery, vehicles, for the purpps£ of delivering a metastatic marker mRNA or 
oligonucleotide (either with the sequence of native metastatic marker mRNA or its 
. complement), , full-length metastatic marker protein, metastatic maricer fusion protein, 
metastatic marker.polypeptide, or metastatic marker-specific ijbozyme or single-chain 
.antibody, into a cell, preferably a eukaryotic cell. According to the present invention, a 
15 ^ gene deli very, yehide can be, for, example, naked plasmid DNA, a viral expression 
; _ vector . comprising a metastatic marker subgenomic j>plynucleotide, or a metastatic 
marker subgenomic polynucleotide in conjunction with a liposome or a condensing 
' .agent- 

^ . , _ i . In one embodiment of tlie invention, the gene delivery vehicle comprises 

20 a ^promoter knd a metastatic marker subgenomic polynucleotide. Preferred promoters 
; ...are tissue^^cific promoters and prpmpters which are . activated by cellular 
proliferation, such as the thymidine kinase and thymidylate synthase prompters. Other 
preferred promoters include promoters which are activatable by infection with a virus, 
such as the a- and ^interferon prompters,, and prompters which are activatable by a 
?5 horjnone. such as^ estrogen. Other promoters which can be used include the Moloney 
virus LTR, the CMV prompter, and the mouse albumin promoter. 

t A metastatic, marker, gene delivery, vehicle can comprise viral sequences 
such as a viral, origin of replication or packaging signal: These viral sequences can be 
selected from viruses such as astrovirus, coronavirus, orthomyxovirus, p<apovavirus, 
30 paramyxpvirus, parvovirus, picprnayirus, poxvirus, retro virus, togavirus or adenovirus. 
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In a preferred ertifcodimeni, the metastatic lirarker gene delivery vehicle is a 
recombinant retroviral vector. Recombinant retroviruses and Various uses thereof have 
been described in numerous references including, for example, Mann et al., Cell 
35:153, 1983, Cane and Mulligan, ProcJ Nat '1 Acad. Sci. USA 81:6349, 1984, Miller et 
5 a l. y Human Gene Therapy 1:5-14, 1990, US. Patent Nos. 4,405,712, 4,861719, and 
4,980,289, and PCT Application Nos: WO 89/02,468. WO 89/05,349, and WO 
90/02,806. Numerous retroviral gene delivery vehicles can be utilized in the present 
invention, including for example those described in EP 0.415,731; WO 90/07936V WO 
94/03622; WO 93/25698; WO 93/25234; US. Patent No. 5,219,740; WO 9311230; 
10° WO 9310218; Vile and }^ Cancer Res. 53:38^-38641 1993; Vile and ti^ Cancer 
Res. 53:962-967, 1993; Ram eial, Cancer Res 53:83-88, 1993; Takamiya er a/., J. 
Neurosci Res 33:493-503, 1992; Baba et"at, J. Newosurg 79:729-7351 1993 (US. 
Patent No. 4,777,127, GB 2,200,651, EP 0,345,242 and W09 1/02805). 
? Particularly preferred retroviruses are derived Irom retroviruses' which 

15 include avian leukosis vims (ATCC Nos. VR-535 and VR-247)! bovine leukemia virus 
(VR-1315), murine leukemia virus (MLV), miiutceil focus-inducing virus (Koch et al., 
j^Vir. 49X1%, 1984; and Oliff et at, J- Vir. 48:542. 1983). murine sarcortia virus 
(ATCC Nos. VR-844, 45010 and 45016), reticuloendotheliosis virus (ATCC Nos VR- 
994, VR-770 and 450 11), Rous sarcoma virus, Mason-Pfizer monkey virus, baboon 
20 endogenous virus, endogenous feline retrovirus {e.g.. RD1 14)1' and mouse of rat gL30 
' sequences used as a retrovirat vector. Par^culariy preferred strains of MLV from which 
recombinant retrowWes ^ include 4070A and ISMA' (Hartley and 

U Rowel J Vir. 19:19, 1976), Abelson (ATCC No. VR-999), Friend (ATCC No. VR- 
245), Graffi (Rue/ ed.,J.Vir. 67:4122. 1993; and Vantchev Neopiasmd263W, 1979). 
25 Gross (ATCC No. VR^590), Kirsten (Albiiid et aL J. Exp. Med 164:1710, 1986), 
Harvey sarcoma virus (Manly "et al . X Vir. 62:3540. 19881 and Albino W at, J. Exp. 
Med 164: 1710, 1 986) and Rauscher (ATCC No. VR 99$)/ and Moloney MLV (ATCC 
No. VR 190); A particularly preferred non-mouse retrovirus is Rous sarcoma virus. 
Preferred Rous sarcoina i viruses include Bratislava (Manly e/ ail J. Vir. 623540, 1988; 
30 and Albino et aL J. Exp. Med 7^:1710,1986). Bryan high titer ATCC Nos. VR- 
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334, YR^57, yR-726, VR-659, and VR-728), Bryan standard (ATCC No. VR-140), 
Carr-Zilber (Adgighitov el ql .Neoplasma 27:159, 1980) v Engelbreth-Ho!m (Laurent e/ 
aL, Biochem Biop^ Harris. Pragu^ (e.g.. ATCC Nos. VR-772, 

ahd 45033)-^ 

5. . , v v,. Any of the above retroviruses can.be readily utilized in order to assemble 
of construct retroviral metastatic marker gene delivery vehicles given the disclosure 
provided herein and stondard recombinant techniques (e.g., Sambrook el ai f Molecular 
^Cloning: A -laboratory Manual, 2d ed., Cold Spring Harbor Laboratory Press, 1989, 
and Kunkle, PNAS 82i4^> 1985) known in the art Portions of retroviral Metastatic 

10 f marker expHresfsion Rectors, can be derived from different retroviruses. For example^ 
retroyector ^ LTRs oan .bp deriv^,from a murine sarcpma virus, a tRNA binding site 
from a Roys s^coma virus, a packaging signal from a murine leukemia virus, and ari 
, origin of second strand synthesis from an avian , leukosis virus. These recombinant 
rretrpyiral v yectprs % can be ysed to generate transduction competent retroviral vector 

15 particles by introducing them into appropriate packaging cell lines (see Serial No. 
07/800^21, filed November 29, 1991). Recombinant retro viruses can be produced 
which direct the^siteTspecific integration of the recombinant retroviral genome into 
■specific regions, of the host cell DNA. Such site-specific integration can be mediated by 
a chimeric integraste incorporated into the retroviral particle (see Serial No. 08/445,466 

20 , filed j^ay 22, 1995). It h preferable that the recombin^t viral gene deHvery vehicle is 
a replication-defective recombinant virus. 

Packaging cell lines suitable for use with the above-described retroviral 
gen^ delivery vehicles can be ^ readily prepared (see Serial No. 08/240,030, filed May 9, 
1994; see ofco WO 92/05266) and, used to create producer cell lines (also termed vector 

25 cell lines or "VdLs") for production of recombinant, viral particles. In particularly 
preferred embodiments of the present ^ invention, p^tcaging cell lines are made from 
human (ejg., HT1 080 cells) or mink parent cell lines, thei^by^lowmg production of 
recombinant retroviral gene .delivery vehicles which are capable of surviving 
inactivation in human serum. The construction of recombinant retroviral gene. deli very 

30 vehicles is described in detail in WO 91/02805. These recombinant retroviral gene 
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delivery vehicles can be used to generate transduction competent retroviral particles by 
intrtKhicing them into appropriate packaging cell lines (see Serial No. 07/800,921). 
Similarly, adenovirus gene delivery vehicles can alsb be readily prepared and utilized 
given the disclosure firovided hereiii (see also Berkner, Biotecfihi4u^6:6\6^27; 1 988, 
5 arid liosenfeld et al., Science 252:431-434, 1#1\ WO 93/07283- WO 93/06223, and 
WO 93/07282) 

A metastatic marker gene delivery Vehicle can' als6 be' a recombinant 
^enoviral gene delivery vehicle; Such vehicles can be readily spared and utilized 
given the disclosure provided herein (see Berkner, BiotechtiqUes 6:6t6, 1988, and 

lb ftosOTf^ld eiaL Science MiV WO 9^7283^ W 93/06223, arid WO 

93/07282)^ Adeno-associaied viiral metastatic marker gene delivery vehicles can also be 
constructed and used to deliver metastatic marker aihiho acids of nucleotides: The use 
of adeno-associated viral gene delivery vehicles in vitro is described in Chatfeijee et aL 
Science 258: 1485-1488 (1992), Walsh et ati Proc. Nai l Acad &L W 7257-7261 

15 (1992), Walsh etaL 1 Clin invest 94: 1440-1448 (l^H);* lotte et aL X BioL Ghent 
268: 3781-3790 (1993), Ponnaihag^h et ai.X Exp. Med i79:l3W3& (1994), filler et 
aCProc. Natl Acad Set 91: 10183-10187 (1^ 
' 343 Exp. HentatoL 23: 1261-1267 (1995), arid Zh6u et aL, Gene 

therapy 3: 2l3^i229 (1 9^6). 7w vivo use of these vehicles iS described in Flotte et aL 

20 Proc Natl Acad Set 90: 10613-1 06 17(1 9^3), ^ feplitt et dL WatUfe Genet 5:148- 

153 (1994). f 
X; In mother embodiment of the iriveritibni a nietakatic marker gene 

delivery vehicle is derived trbm a togavirus. Preferred togavii^ alphaviruses, 
in particular No. 0^405,627, filed March 15, 19$5, WO 

25 95/07994. Alpha wuses, including Sindbis arid ELVS viiy&^ can be gene delivery 
vehicles for metetatic marker polynucleotides. Alplia vinises are described in WO 
94/21792, WO 92/10578 ^d WO 95/07994. Several different al^vinfe gme delivery 
vehicle systems can be constructed and used to deliver inetMatic marker subgenbmic 
polynucleotides to a ceil according to the present invention. Rejpre^tative examples 

30 of such systems include those described in U S. Patents 5,09 1^309 and 5:217,879. 
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Particularly preferred alphavirus gene delivery vehicles for use in the present invention 
include those which are described in WO 95/07994* and US, Serial No. 08/405,627^ 

Preferably, the recombinant viral vehicle is a recombinant alphavirus 
, viral vehicle based on a Sindbis vims. Sindbis constructs, as well as numerous similar 

5 ., constructs* ^an be readily prepared essentially as described in U.S. Serial No. 
08/198,450. Sipdbis viral gene delivery vehicles typically comprise a 5* sequence 
capable of initiating Sindbis virus transcription, a nucleotide sequence encoding Sindbis 
non-structural proteins, a viral junction region inactivated so as to prevent subgenomic 
.fragment transcription, and a Sindbis RNA polymerase recognition sequence. 

10 . Optipnajjly, the viral junction region can be modified so that subgenomic polynucleotide 
transcription is reduced, increased, or maintained.. As will be appreciated by those 



in 



the art. Corresponding regions from other alphaviruses can be used in place of those 
djescribed above. 

v , The viral junction region of an alphavirus-derived gene delivery vehicle 
15 can comprise a first viral junction region which has been inactivated in order to prevent 
transcription of the subgenomic polynucleotide and a second viral junction region 
which has been modified such that subgenomic polynucleotide transcription is reduced. 
. ' ;An alj>havims^erived vehicle can also include a 5' promoter capable of initiating 
synthesis, of viral lyMA from cDNA and a 3' sequence which controls transcription 
20 ,temination. )7 

/ ^ ^ Other recombinant togaviral gene delivery vehicles which can be utilized 
in the present invention include those derived from Semliki Forest virus (ATCC VR-67; 
ATCC VR-1247), Middleberg virus (ATCC VR.370), Ross River virus (Atfcc VR- 

' • V. A-V\ -:v^ /; . ...,w ; ..,t : -0:.;,,/ ; ,\ *V . 

373; ATCC VR-1246), Venezuelan equine encephalitis virus (ATCC VR923; ATCC 
25 VR-1250; ATCC VR-1249; ATCC VR-532), and those described in U.S. Patents 
, 5,091,3(» and 5^17,879 and in WO 92/10578. The Sindbis vehicles described above, 
as well as numerous similar constructs, can be readily prepared essentially as described 
in U.S. S^l No, 08/198,450, 

. Other viral gene delivery vehicles suitable for use in the present 
30 invention include, for example, those derived from poliovirus (Evans et. a/., Nature 
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539:3851 1989, and Sabin ef at, J. Biol. Standardization 7:115, 1973) (ATCC VR-58); 
rhinovims (Arnold et al, J. Cell Biochem^LAOl, 19%) (AT CC VR-1110); poxviruses, 
such as canary pox virus or vaccinia virus (Fisher-Hoch et al, PNAS 86:317, 1989; 
Flexner ei al Ann. N.Y. Acad Set 569:86, 1989; Flexner et al. Vaccine 8:17, 1990; 

5 U.S. 4,603,1 12 and U.S. 4,769330; wb 89/0i973) (ATCC VR-1 1 1; ATCC VR^OIO); 
SV40 (Mulligan e/ al , Nature 277 1979) (ATCC VR-305), (Madzak e/ al.,J. ben. 
Vir. 73:1533, 1992); influenza virus (Luyijes et al,Cell 59:H07* 1989; McMichcal «/ 
al t The New England Journal of Medicine 300:13, 1983; and Yap el al., Nature 
273-23%, 1978) (ATCC VR-797); parvovirus such as adeno-assbciated virus XSamulski 

10 et al,J. Vir 53:3822, 1989, and Mendeison et al. Virology 7^:154, 1988) ^ATCC 
VR-645); herrxs simplex virus (Kit eial, Adv. Exp Medl Biol 215l\9, 1989*) (ATCC 
VR-977; ATCC VR-260); Nature 277: 108, 1979); human ^ irninuiic^eiKiency virus 
(EPO 386,882, Buchschacher et al, J. Vir. 66:273\, 1992); measles virus (EPO 
440,219) (ATCC VR-24); A (ATCC VR-67; ATCC VR-1247); Aura (ATCC VR-368), 

15 Bebaru virus (ATCC VR-600; ATCC VR-1240), Cabassou (ATCC' VR-922), 
aikungurrya virus (ATCC VR-64; ATCC VR-1 241 ) ■ Fort Morgan (ATCC VR-924), 
Getah virus (ATCC VR-369VaTCC VR-1243), Ky^lag^ (ATCC Vr:927), Mayaro 
^(ATCC Mucambo virus (ATCC VR-580; ATCC VR-i244), Ndumu jfAtCC 

VR-371), Pixuna virus (ATCC VR-372; ATCC VR-i245); Tonate (ATCC" VR-925). 

20 Triniti (ATCC VR-469), Una (ATCC VR-374), Whataroa (ATCC VR-926); Y*62-33 
(ATCC VR-375), 6*Nyoing vinis, Eastern encephaiitis virus (ATCC VR-65; ATCC VR- 
; 1242), Western encephalitis' virus (ATCC VR-70; ATCC VR-1251 ; ATCC VR-622; 
ATCC VR 1252), and coronavirus (Hamre et al., Proc. Soc. Exp. Biol Med. 727:190, 

1966) (ATCC VR-740). 

25 A subgenomic metastatic marker polynucleotide of the invention can 

also be combined with a condensing agent to form a gene delivery vehicle. In a 
preferred embodiment, the condensing agent is a polycation. such as polylysine, 
polyargihine, polyornithine, protamine, spermine, spermidine, and putrescine. Many 
suitable methods for making such linkages are known in the art (see, for example. Serial 

30 No. 08/366J87, filed December 30, 1994). 
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\* : * i *i v . In^ - an alternative embodiment, a metastatic,, marker .^ubgenomic 
polynucleotide is associated with a liposome to fprm a gene delivery vehicle. 
Liposomes are small, lipid vesicles comprised otf an aqueous compartment enclosed by 
a lipid bilayer, typically spherical or slightly _ elongated stniptures several hundred 
5,;/;AQgstron»in^diaineter. Under appropriate condhjons. a liposome pan fuse with ( the 
plasma membrane of a. cell ..or with the membrane of an endocytic vesicle wiUiin a cell 
> which has internalized the liposome, thereby releasing it? contents into the ^ cytoplasni. 
, Prior to interaction with the surface of a cell, however, &e liposome membrane act s as a 
. relatively impermeable barrier which ^uestere ^d 
10. ,fiom*;degra<iative enzymes. Additionally, because a liposome is a synthetic structure^ 
specially designed liposomes can be produced which incorporate desirable features. 
:»syfiee Stryer, Biochemistry, pp. 236-249, 1975 (W.H. Freeman, San Francisco, CA); 
, > 1980; Bayer et a/., Biochim. Biophys. Acta. 

550:4^/1979; Wynay e/ a/. ? Metk EnzyntoL 1987; Wang el al ? PNAS 84: 

15 7851, 1987, Plant et al t Anal BiQchem. 77£:420 V 198?, and U.S. Patent 4,762,915. 
Liposomes can encapsulate a variety of nucleic acid molecules including DNA,JRNA, 
• plasmids, and expression construpt$ comprising metastatic marker subgenomic 
polynucleotides such those disclosed in the present invention. ^ , 
' , , : Liposomal preparations for use in Ae present invention include 

20 (positively charged), anionic (negatively; charged) and neutral ^ preparations. Cationic 
\ lipospmes have been shown to mediate delivery of plasmid Dl^A (Feigner 
et a!., Proc. Natl Acad Scl : USAJ4:74\ 3-7416, 1987), mRN A (Malone et ai 7 /Voc. 
c Nat% AcadL &l V 1989), and purified Ascription factors (Debs et 

<aL y J : BioL Chem. ^5:101^9^0192, 1990), in Juncdond form. Cationic liposomes are 
25 ..ijegdily available. For example, Nfj-^jrdi^^ 

, (PPTMA) jippsomes are available under the trademark Lipofectin. from GIBCQ, BRL, 
Grmd Island, NY. See also feigner et aL Prqc : Natl Acad. Set USA 9/5148- 
^ 5152:87, 1994. Other commercially ^available liposomes include Transfectace 
(PDAB/DOPE) and DOTAP/DOPE J[Boerhinger). Other cationic liposomes can be 
30 prepared from readily available materials using techniques well known in the art. See. 
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eg.;$^eiaL Prod. Naif Acad. ScL USA 75:4194^4198; 1978; and WO 90/11092 
for descripUohs of the synthesis of DOTAP (l,24>is(oleoyl6xy)-3- 
(trimethylairtmohi liposomes. ' ' 

L Similarly, anionic and rieutral lipb^^ available, such as 

5 from Avanti Pol^ Li^s (Birmiii^am, At), or can bt easily prepared using readily 
avaiial)ie materials. Stich materials itKlude j^ospHatidyi choline: cholesterol, 
phosphatidyl Manolamin^ dioleoylphosphatidyl choline (DOPC), 
dibi^ph6^iat^yl glycerol (DOPG), dibleoylphoshatidyl ethaholamine (DOPE), 
among otHeris. Thesle Materials can bt mixed with the DOTMA and DOTAP 

10 string m^tenals Wethckiis for making liposomes teing these 

nwterialsare wll kno^ ihth^^. ^^^^^ ' ^ i 

the lij)OSomes can comprise mdltilaihnielar vesicles; (MLVs), small 
imil^ellar vesicles (SUVs), or large iimlamellar vesicas (LUVs). The various 
lipb^me^ucletc acid complexes are prepiahed usirijg methods known in tfe art; See, 

15 <tg, Straiibinger ei at, METHODS OF IMMUNOLOGY (1 983), Vol. 101, pp. 512-527;' 

mchik Biophys, J9*483^ CW/ 77:77, 1979; Deaiher and 

Bangham, Biochim. BiopkysAcict 443*29, 1976; Ostro ei aL/BioctemtBioph^. Res. 
t<m^ 1979; Enoch 

20 arid Stritthiktter, Proc. Nat'lAiatt Sci: 0SA 76:\45, 1979; Fraley it aL, J. BioL Chem. 

^'^^M4i^ : i^iH Sz6fca M f^0^6^ USA 75:U5, 

197^; and Schaefer-Wdder ei at, Science 21 5il66 r 1982: * v 

' In additiorif Hiibi>rdiei iik <San be included \vitti a ihdastatfc marker 
L subgenbmic polynucleotide f6r delivery to a tell. Examples of such lipoproteins 

25 include chylomicmns; tiDU IDIJ LDL,and VLDLi Miit^ts, fragmdits; or fusions of 
tliese protei^ can als6 be iiscri.^M^ occurring lipoproteins can 

also be used, such as ac&ylated LDL; The^e lipoproteins can target the delivery of 
polynucleotides to cells iexpfe^ing lipoprotein receptors. Preferably, if lipoproteins are 
included with a ix>lynucleotide, ri6 othbr taa^eting ligarid is included in the comjx**itioa 
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: t Jn ^another embodiment, naked metastatic marker subgenomic 
polynucleotide mojectiles are used as gene deli very vehicles, as described in WO 
90/11092 and U.S. Patent : 5,580,859. Such ^ene deliyery vehicles can be either 
metastatic marker DJsIA or RNA and r in certain embodiments, are linked to killed 
5 t adenoyiixs. Quriel et al, Hum. Gene. Ther. 3:147 : 154, 1992. Other suitable vehicles 
include DNA-ligand (Wu et al. y J, Biol Qhenu 264: 16985- 16987, 1989), lipid-DNA 
combinations (Feigner et aL, Proa Nat'l Acad Scl USA W:7413 7417» 1989), 

- Keif.; t'O - n 'r. , i* ,v 

liposomes (Wang e/ ^ 1987) and 

jwictop^ 

10 tf " .Qne can increase the efficiency of naked metastotic marker sul^enomic 

. polynucleotide uptak^ into cells *by coating the pojynuclesotid^s onto biodegradable 
latex teads. This approach takes advantage of the observation that latex beads, when 
incubated \yith cells in culture^ and concentrated in the 

. perinucl^.^iqn.of the cells. The beads .will then be transported into cells when 
15, injected into muscle. Metastatic marker subgenomic polynucleotide-coated latex beads 
will be efficiently transported into cells after endocytosis is initiated by the latex beads 
( - , - and thus, increase ^ene transfer and expression efficiency. This method can be 
jmprpyed further .by treating the beads to increase their hydrpphobicity, thereby 
• facilitating the,, disraption : of the endpsome ^nd release of £ metastatic marker 
20 £ubgenom^ 

^ r ;i . f T^e iiwentipn provides a method of detecting metastatic marker gene 
expression in a biological sample. Detection of metastatic marker gen^ expression is 
useful, for example^, for identifying metast^ for determining metastatic potential in 
> a tissue sample, preferably a tumor. Appropriate treatment regimens can then be 

25. designed for patients, who are at risk for developing metastatic cancers in other organs 
ofthebod>\ . iv 

t The body sample can be. for example, a solid tissue or a fluid sample. 
Protein or nucleic acid expression products can be detected in the body sample. In one 
embodiment, the body sample is assayed for the presence of a metastatic marker 

30 protein. A metastatic marker protein comprises a sequence encoded by a nucleotide 
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seqiieiice shown ui SDQ ID NOSrl -85 or ito implement and can be detected using the 
marker potem-specific antibodies of the present invention, the antibodies £aii be 
labeled, for example, witti a radiorcti w, fluorescent biotiriyiated, of eii^matic tag and 
detected d&rectI)C or can be detectiwl using indirect immunochemical mbthc^; tisii^ a 
5 labeled s^ndaiy antibody. The presence of the metastatic' marker proteins can be 
assayed, for example, in tissue sections by immurio^tochemistry, or in lysaties, using 
Western blotting, as is known in the art 

In another embodiment, the body sample is assayed for the jpreserice of 
marker protein mJtN A A sample can ite cont^ted with a nucleic acid hybndiation 
10 p^ferap^ polypeptide: 
Still furtSier^ the sample ^n be sul>jectwl to a Northern blbtting technique to detect 
mRNK indicting exp^sion of the pol>pqpiide^ For those tetHniiques in which rnRNA 
is detected, the sample can be subj^ted to a nucleic acid amplification prdce^ whereby 
the mRNA molecule of a Selected part theiwf is amplified usi 
15 primers. Other j^A detection ^ ^ also be used, including, but ribt limited 
to, in situ hybridization. 

Marker protein-sp^iific j>rote5 can be generated usin£ the cDNA 
s^ue^s disclosed in SEQ ID NOS:f-85: the probes are prefefably at le^st 15 to 50 
7 nucleotides mlength, although ^ey cairi be at least C 10, 1 1 , 12:20, 25, 9K> A 40. 45, 
20 60, 75, or 100 or more nucleotides in length TTie probes can b£ sytithea^ 

or ran be gentled from longer i*l)^ The probes 

can be labeled, for example, with a fadibaciive, biotinylated;6f fliibie^eht l^g: 

in a body sample can be quaniSt^&! QuantitatiM can be acxomjiiislie^ fof ekample, 
25 by comparing the level of exj^i^ion product detected in the fodty sartij>le With the 
amounts of product present in a standard curve. A comparison can be made Visually or 
using a techmque such as densitometry, with or withbuicpmjputefized assistance. For 
use as controls, body samples can fee isolated fi^ other h unians, other ribrinrancerous 
organs of the patient being tested, or non-metastatic breaist or cbloft c&ricef froin the 
30 patient being tested. 
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? f >^ m Pplyiiucleotides encoding metastatic marker-specific reagents of the 

invention, such as antibodies and nucleotide probes, pan be., supplied in a kit for 
: detecting maiker ge^ a biological sample. The kit can also 

: .contain buffed pr lafeHng ^mponents^ as well as instructipns for using the reagents to 
5 detect tlje marker expiessipnp^^ 
Ifexpr^ 

shown in SEQ IDNQS:?, 4, 9, 13, 14, 19, 26, 29, 3^1,48, 55, 57, 60, 6?, 64, 72, 73, 
82, or 83 is dejected, the biological sample TOntains cancer cells which will likely 
metastasize tp the lung. If expression of a metastatic marker gene havipg a nucleotide 

10 sequence shpwn in SEQ ID NO^: 1 , 5, 11 , J 8, 20. 22, 24, 30, 33, 35, 36, 38, 45, 52, 58. 
65, 66, 70, 74 y 76, or 80 is detected, the biological sample contains cancer cells which 
<. will, likely metastasize to the bone and/or Jung. On the other hand, if expression of a 
metastatic marker gene having a nucleotide sequence shpwn ip SEQ ID NOS:3, 7, 8, 10, 
12, 15-17,21,23,25,28, 31,34,37, 42^4,^6,47, 49-51,53, 59 ? 61, 62, 67, 68, 75, 77- 

15 79, 81, 84, cg ZS is detected, the biological sample cpntains cancer cells which will 
likely not metastasize. Detection of expression of a metastatic marker gene comprising 
the nucleotide sequence shown in SEQ ID NO:56 also indicates that the biological 
sample contains cancer cells which will likely metastasize. This information can be 
^ >, used, for example, to design treatment regimens. Treatment regiments can include 

20 . altering expression of one or more metastatic marker genes, as desired. Metastatic 
marker gene expression can be aJtered for ther^utic purposes, as described below, or 
can be used to identify therapeutic agems. , ( : ( 

In one embodiment of the invention, expression of a metastatic marker 
gene y/hpse. expression is up-regulated in metastatic , cancer is decreased using a 

25 ribozyme, an RNA molecule with catalytic activity. See, e.g. , Cech, 1 987, Science 236: 
1532-1539; CtcK A^ Ann. Rev. Biochem. 59^543-568; Cech. 1992, Curr. Opiri. 
Struct ^ ^/o/, 2: 605-6Q9* Cputure and Stinchcomb, 1 996, trends Genet. 72: 510*515. 
Ribozymes can be used to inhibit gene fimction by cleaving m RNA sequence, as is 
K knpwnin%^ 
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Ceding sequences of ' v ri»tast^tic ; "ma^^ 'geHes csiii' - bSe' uted to generate 
ribozymes which will specifically bind to mRNA tatt^^ : 6cm's^^Bl^'itidAgr. 
gene. Metnods of designing and consiiruciing nbozymes which can cleave other UNA 
molecules in trans in a highly sequence specific manrfer hiave been 1 developed and 
5 described in the art (see Has&off^ 

tfe cleavage activity of nb^^es can be targeted to specific' ftMs by engineering a 
discfete "hybridization" region into the ribozyme. the hybridrzafibti region contains a 
^uem:e complementary to the tai^et RNA and thus specifically liybridizes wim the 
target (see, for example, Gerlach, W. L: etaL, EP 321201)1 linger complementary 
10 sequences can be used to increase the affinity of the hybridization sequence for the 
target. The hybridizing and cleavage regions of the ribozyme can be integrally related; 
thus, upon hybridizing to the target RN A through the complementary regTohs, the 
catalytic region of the ribozyme can cleave the target. 

Ribozymes can be introduced into cells as part of a DNA construct, as is 
15 known in the art. The DNA construct can also include traiiscriptional regulatory 
elements, such as a promoter element, aft enhancer or UAS element, and a 
transcriptional terminator signal, for controlling the transcription <>f the ribozyme in the 
cells. 

1 Mechanical methods, such as microinjection, liposome-ihediated 
20 transfection, electroporation, or calcium phosphate precipitation^ can be used to 
introduce a ribozyme-cofttaining DNA construct into cells whose division it is desired 
to decrease, as described above. Alternativeiy, if it is desired that a DNA construct be 
stably retained by me cellsV the DNA constrWt can be supplied on a plasmid and 
maintained as a separate element dr integrated into the graomeof the cells, as is known 
25 in the art. 

As taught; in ttasejoff ^if al^ 'U .SJ 5;&1#73V ribozymes can be 
engineered so that their expression will occur in ^ resporise id ^ fec^ wluch induce 
expression of metastatic marker gaies. R^bozymes can also be engineered to provide an 
additional level of regulation, so ftat destrucUon of mllNA occurs only when both a 
30 ribozyme and a metastatic marker gene are expressed in the cells. 
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.Expression of a metastatic marker gene can also be altered using an 
antisense oligonucleotide sequence. The antjjsense seqiience is complementary to at 
least a portion of thd coding sequence of a .meta^atjc .irarker'gem having a nycleptijde 
sequence shown in SEQ ID: NOS: 1 -85. The ccmplement of a nucleotide sequence 
5 (Shown in SEQ ID .NQS: -85 is a. ( contijBUQus ; ^^nce ): Qf] nucleotides which form 
AVatson-Crick bas£p^ w%£contigu^ in SEQ 1DNOS: 

- 1-85. : -.-.;:<, & , - ; : ••. ; v ; • ^. w. -\ ■ . . : 

; > Prefeiabjy, the antisense oligonucleotide sequence is at least six 

nucleotides fin length, but can be at least about 8, 12/ 15, 20, 25^30^ 35, 40, 45, or 50 

10 [; nucleotides long, tonger sequences • can:.^^. te^.u^.. (i ^tisqise oligonucleotide 

' : moleculescan be protyided in a PNA construct and introduced into cells whose division 

: isto . , 5 ... ^ ,■ ..- tli ■ ;i , 

< ,f Antisense oligonucleotides.. N; ,|t^ui.,. comprise depxyritK)nudeotides, 

. ribonucleotides, or a combination of both Oligonucleotides can be synthesized 
15 manually,, or by an aptomatttl synthesizer, by qoyalently linking the 5* end of one 
nucleotide with the 3* end of another nucleotide with non : phosphodiester 
internucleotide , linkages , k v such alkyjphp^honates, : t 5 phosphorothioates, 
phosphpmdithipates, alkylphpsphonothioates, .dkylphosphonates, phosphoranudaites, 
.phosphate esters, carbamates, acetamidate, c^^^ymethyl esters, cartonates. and 
20 phosphate triesters. See Brown; \99A r Metk Mql Biol 20:1-8; Sonveaux. 1994, Meth. 
MolBiolM^ l^n^kt^^^.^ Cte^Rey. P0:M3r583. ^ ; 

formation between an antisense molecule and the cpmplementary coding, sequence of a 
metastatic marker gene, antisense molecules with no more than one mismatch are 
25 ; preferred. . One skfllcsd . -ip .. ; t|ie. .art -..C911 . easily - jusc; the calculated meriting point of a 
metastatic marker gene ant|^nser^nse pair to determine; the degree of mismatching 
which will be tolerated between a particular antisense oligonucleotide and a particular 

.Antisense oligonucleotides can be modified without affecting f their 
30 ability to hybridize to a metastatic marker protein coding $equence. These 
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modifications can be internal or at one or both ends of foe antisense molecule. For 
example^ internuclepside phosphate linkages can be modified by adding chblesteryl or 
diamme moieucs with varying niimbeVs 1 o f carbon residues between the amino groups 
an^ierinmal ribbse. Modified bases aWd/or sugars, such as arabihose instead of ribose. 
5 * or a • S*-substitute<i oiigonuclebiide in Which the 3* hydroxyl group or the 5' phosphate 
group are substituted, can also be Employed in a modified antisense oligonucleotide, 
these modified oligonucleotides can be prepared by methods well known in the art. 
Agrawal et al., 1 992, trends Bidtechkol. 10:152-158; Uhlriianh el al., 1990, Chem. 
K - Rev. 90:543-584; Uhlmahn et al., tetrahedron. Lett. 275:3539-3542. 
,0^'; Antibodies of the invention which specifically bind td a metastatic 

''' n ina»ker protein can also be used to alter metastatic marker gene ' expression. By 
antibodies is meant antibodies and parts or derivatives -lu4er^f,--s^ft ; VaSr:sing|e'diain 
antibodies, that retain specific binding for the protein. Specific antibodies bind to 
^rte^atic marker proteins and prevent the proteins from functioning in 'thie cell. 
1 5 Polynucleotides encoding specific antibodies of the invention can- be introduced into 
cells, as described above. 

' Marker proteins of the present invention can be used td screen for drugs 
which have a therapeutic anti-metastatic effect. The effect of a test compound on 
metastatic marker protein synthesis cari also be used to identify test compounds which 
20 modulate metastasis: Test cdmpouhds which can be screened include any substances, 
whether natural products or syhthetic, which can be administered to the subject. 
' Libraries or mixtures of com>£ or substances can 

be those for which a pha>mac^u6cai ! effect is previously known" 6r unknown. 

A cell is contacted witli a test coiripbund. the cell ca^ be any cell, such 
25' as a colon cancer : wYl/^iclli'^^My-i^^i^ the metastatic marker proteih being 
yi provide i'Sftjwbjii^'t^I 1 ^^ which can be 
' . used TcV screening assays. " •" 1 " "* s 

Synthesis of metastatic marker j^tems can be measured by ^ aiiy means 
' ' for nieasuring protein syritihesis known in trie art, such as incorpbration of labeled amino 
30 " acids into proteins and detection of labeled metastatic marker proteins in a 
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polyacrylamide gel, The amount of metastatic marker proteins can be detected, for 
example, using metastatic marker prpteinrspecific antibodies of the invention in 
Western blots, the amount of the metastatic marker proteins synthesized in the 
prince or absence of a test compound can be determined by any means known in the 

5 art, such as comparison of the amount of metastatic marker protein synthesized with the 
amount of the metastatic marker proteins present in a standard curve. 

The effect of a test cpmpound on metastatic marker protein synthesis can 
also be measured, by Northern blot analysis, by measuring the amount of metastatic 
mai^O" protein mRhJA expression in response to the test compound using metastatic 

10 fnar^er protein specific nucleotide probes of the invention, as is known in the art. 

Typically, biological sample is contacted with a range of concentrations 
of the tesl comjpound, such as 1.0 nM, 5.0 nM, 10 nM, 50 nM, 100 nM, 500 hM, 1 mM, 
10 ipM, 50 mM, and 100 mM. Preferably, the test compound increases or decreases 
. expression of a metastatic marker protein by 60%. 75%, or 80%. More preferably, an 

15 increase or decrease of 85%, 90%, 95%, or 98% is achieved. 

The invention provides compositions for increasing or decreasing 
expression of metastatic marker protein. Therapeutic compositions for increasing 
metastatic marker gene expression are desirable for markers which are down-regulated 
iri metastatic, cells. These compositions comprise polynucleotides encoding all or at 

20 ? least a portion of a metastatic marker protein gene expression product. Preferably, the 
therapeutic composition contains ^ri expression construct comprising a promoter and a 
polynucleotide segment encoding at least a portion of the metastatic marker protein 
which is effective to increase or decrease metastatic potential. Portions of metastatic 
marker genes or proteins which are effective to decrease metastatic potential of a cell 

25 can be detenr^ined, for example, by introducing various portions of metastatic marker 
$enes or polypeptides into metastatic cell lines, ^ch as M 

Kinl2C, or Km 1 2L4, and assaying the division rate of the cells or the ability of the cells 
to form metastases when implanted in v/vo, as is known in the art. Non-metastatic cell 
lines, such as MCF-7, can be used to assay the ability of a portion of a metastatic 
30 marker protein to increase expression of a metastatic marker gene. 
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Within the expression Construct, the polynucleotide; segjrient is located 
downstream from the promoter, and transcription of the polynucleotide segment 
initiates at the promoter. A more complete description of gene transfer vectors, 
especially retroviral vectors is contained in U.S. Serial No. 08/869,309, Which is 

5 incorporated herein by reference. 

Decreased metastatic ma rker gene expressidn is desired in bbiiditioris in 
which the marker gene is up-regulated in metastatic cancer. Therapeutic compositions 
for treating these disorders comprise a polynucleotide encoding z rieagent which 
specifically binds to a metastatic marker protein expression pirMuct, as rfisclds^ herein. 

10 Metastatic marker therapeutic compositions of the invention can 

comprise a pharmaceutical^ acceptable carrier: Pharniaceutically acceptable carriers 
are well knowri to those in the art. SucK carriers include, but are not limited to, large, 
slowly metabolized macromolecuies, such as proteins, polysaccharides, polyiactic acids, 
polyglycolic acids, polymeric amino acids, amino acid copolymers, and inactive virus 

15 particles. Pharmaceutical^ acceptable salts can also be used in the composition, fdr 
example, mineral salts such as hydrochlorides, hydrobromides, phosphates, or sulfates, 
.^s well as the salts of organic acids such as acetates, propribiiates, malonates, or 
benzoates. 

Therapeutic composition's can also contain liquids, such as water, saline, 
20 glycerol, and ethanol j as well ^ substanc^ such as wetting agents, emulsifying agents, 
or pH buffering agents. Lip6somes, sucK as those described in U.S. 5,422,156, WO 
95/13796, WO 91/14445, or fiP 524,968 Bl, can also be used as a cariier fdr the 
therapeutic composition. 

Typically, a therapeutic metasbtic^m 
25 injectable, either as a liquid solution or suspension; however, solid forms suitable for 
solution in. or suspension in^ liquid vehicles prior to injection can also be prepared. A 
metastatic marker composition can also be formulated into an enteric coated tablet or 
gel capsule according to known methods in the art, such as those described in U S. 
4,853,230, E£ 225,189, AU 9^24^96, and AU 9.230,801 . 
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. , : . < , Administration of . the metastatic marker therapeutic agents . of r the 
inyehtipn : can include; local or systemic administration, including. jnjection >: oral . 
administration, particle gun, or catheterized administration, and topical admini&ratjon; 
Various ^ fnethb^xan he used to administer a therapeutic metastatic r^ker composition 
5 directly to »a specific ^te in the Jt>ody. ; . f . 

For treatment, of . tumojs. including ipeto a 
, therapeutic metastatic marker composition can be ; injected several times in several 
different locations within the body of tumor. . Alternatively; arteries \*(hich serve a 
tumor^c^nvbe '.identified,, and a therapeutic composition injected into such an artery, in 
10 order ^deliver the composition dirert^ > 

, A, tupior .which has : a .necrotic ^cenjter can he aspirated and the 

r composition injected dir^^ the, now empty center of the tumor. A therapeutic 

> metastatic marker composition can be directly . administered to the surface of a tumor, 
; fpr example r by topical application of the composition. X-r$y imaging can be used to 
15 assist in certain. ;of , the above deliver' methods. t Combination therapeutjc agents, 
including a metastatic ; marker .proteins or ^lypeptide, or ; ; a ; metastatic marker 
subgenomic: polynucleotide and other therapeutic agents, can be administered 
; >; simultaneously^ or sequentially; ; ... . ..... . ; . l , ? 

^ . j ^Re^qjtojr-mediated targeted delivery can be used to deliver therapeutic 

20 :? : comj^ations contai^^ proteins, or, 

reagents such as^nti^dies;-^ anti^i^e.oiigoriucleptides ^cijfc 

:; ;;.Rcxeptorrmedjated ; defivery Findeis et at. 

. (1993), liw&W BiotechnoL Jl^^^ C gene, th^apeutics: 

. : u METHOl^ AND APPLI^ATldNS OF DIRECT GENE ^I^FER (J:A. Wo|ff ? ed.); f Wu & Wu 

25 ( 1 988), J. Biol gtem : J63,fa\^\Wxi r et.al:,(i994) r? A. BipL. Chem u 269, 542^; f Zenke : 
, eV^ Chem. 

• . Alternatively; a metastatic ^ marker therapeutic, composition can be 
. .^introduced into : human cells ex vivo, and the cells then replaced into the human. Cells 
30 , w .cari be. removed from a variety of locations including,, for. example, from a selected , 
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tumor or from an affected organ; In addition, a th^peutic composition can be inserted 
into non-affected, for example; dermal fibroblasts or penpheral blood leukocytes. If 
des^/i^iculat frafitici^bf cells sudh as a T cell subset or stein cells daft also be 
s^cifically removed from the blood {sei>> for fexaimple; PCT W 91/16116), The 
5 removed cells can then be contacted with a metastatic mkk&rft^ 

utilizing my bf the abbve^le&ribed tecjmiqueis, folloW^^^e Taum of the cells to 
the Kifman, preferably to or within the vicinity of a tuiribr or other yte to *e treated, 
the rriethods described abbve cmi addition^^ the steps of depleting 

:/i fibrbblifets or other hori^ntaimnatin^ tumor cell^ subsequent to r^ovirig Wrnbr cells 
10 from a human, and/or the step of itectiv^ 

Both the dose of a metastatic marker compbsitiori and the means of 
' ^imwstration can be determined based bn the specific qualities 6f the therapeutic 
eompbsiiibh, the condition, age; ^d weight of the patient, the prbgi^ibn of the 
^teejise^ and bther felfevant'fi^^ composition of the 

1 5 invention increases or decreases expression of the metastatic marker genes by 50%, 
^ ^ or 80%; Mo^ preferably, expression of the metastatic marker genes is 

' /mtre^ or decreased by 90%; 95%, 99%, br i00%. The efi^tiveh^ ^of the 
mechanism chosen to alter expression of the metastatic iriaHcer genes can be assessed 
^ng Wetho^s well known in the art, Such as hybrrdizatfbri of nucleotide probes to 
20 niRjlA! of the metastatic iMk^ genes, ^^tit^W Rl^(^ ^r detectidri W ^ the 
ij^ta^ie^ [ " 

if " ; vdjM If the bdmiwatibh 

or anti^y, effective 3^ in the raifige bfiilfeut 5 jxg to about 

V 50 |ig/kg of patient body weight, about '50 jig to about 5 rag/kg^ abbiit 1 00 #g tb about 
25 50<ypg/kg of patient body weight, and about 200 to abbut 250 jig^g. 
" J Therapeutic (Compositions containing metastatic hiarker subgehomic 

polynucleotides can be administered in a range of about 100 ng to abbiit 200 mg of 
DNA for IbcaJ admihistratibh. edncentratibri ranges of abbiit 500 ng to about 50 mg, 
i jig to ^ut 2 feg^ abd 
30 of DiNA can also be used during a gehe therapy protocol; Factors si&h as method bf 
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action arid efficacy of transformation and expression are considerations that will affect 
the dosage required for ultimate efficacy of the metastatic marker subgenomic 
: polynucleotides. Where greater expression is desired over a larger area of tissue, larger 
amounts of v metastatic marker subgenomic polynucleotides or the same amounts 
5 administered in a successive protocol of administrations, or several administrations to 
different adjacent or close tissue portions of, for example, a tumor site, can be required 
to effect a positive therapeutic outcome. In all cases, routine experimentation in clinical 
l7 trials w 

Expression of an endogenous metastatic marker gene in a cell can also be 
10 altered by introducing; in frame with the endogenous metastatic marker gene a DNA 
construct cqrpprising a metas^ marker protein targeting sequence, a regulatory 
. sequence, ap exon, and an unpaired splice donor ate by homologous recombination, 
. such Jhat :...a ^ hprnologqusly recombinant ce:ll comprising the PNA cpnstmct is formed. 
; The new transcription unit can be used to turn the metastatic marker gene on or off as 
15; de^ir^d. This method of affecting; endogenous gene expression is taught in U.S. Patent 
j Nq. 5*64 1 ,670, ^which i>incp^rat^ herein by reference. 

The targeting sequence is a segment of at least 10, 12, 15, 20, or 50 
' cpntigupus nucleotides selected from the nucleotide sequences shown in SEQ ID 
, NOSrl -85 or the complements thereof. The transcription unit is located upstream of a 
20 coding sequence pf the endogenous metastatic marker protein gene. The exogenous 
regulatory s^uencedirecte transcription of the coding sequence of the metastatic 
marker, genes. 

r ^ a , t A ; metastatic marker subgenomic polynucleotide can also be. delivered to 

subjects for the purpose of screening test compounds for those which are useful for 

25 : enhancing transfer of metastatic marker subgenomic polynucleotides to the cell or for 
{ .enhancing subsequent biological effects of metastatic marker subgenomic 
polynucleotides within the cell. Such biological effects include hybridization to 
complementary metastatic marker mRNA and inhibition of its translation, expression of 
a metastatic marker subgenomic polynucleotide to form metastatic marker mRNA 

30 and/or metastatic marker protein, and replication and integration of a metastatic marker 
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- subgenomic polynucleotide, the subject can be a cell tiilture or an aiumai; preferably a 
mammal, more preferably a human. 

Test compounds which can be screened include any substances, Whether 
natural products or synthetic, which can be administered to the subject Libraries of 
5 mixtures of compounds caii be tested. The compounds or substan^ can be thbse for 
which a pharmaceutical effect is previously known or unknot. Tte compounds or 
substances can be delivei^ before; after, or concomitantly wtil a meta^ic 
subgenomic polynucleotide. They can be administered Separately ot m ^nuxtiiure with 
a metastatic marker subgenomic ^lyniicleotide. 
10 integration of a deliver^! metastatic marker stibgenomic pdlynuclebtide 

can be monitored by any means known in the art. For example: Soiitlfern blotting of the 
delivered metastatic marker subgenomic polynucleotide can be perforated. A change in 
the size of the fragments of a delivered pol^ucleotide indicates integration. 
Replication of a delivered polynucleotide can be monitored mter alia by detecting 
15 incorporation of ladled nucleotides combing w 

probe. Expression of metastatic marker subgehomic polynucleotide can be monitored 
by detecting production of metastatic marker mRNA Which hybridizes to the delivered 
f^lynucleotide or by detecting metastatic marker proteiri: Met^atic m can 
be detected immimoiogicaily. Thus, the delivery of metric ftaiker subgenomic 
20 polynucleotides according to the present invenUon provides an excellent isystem for 
screening test expounds for their ability to ehhihee traftsft^r ^biF rfoetastatit iriaricer 
subgenomic polynucleotides to a cell, by enhancing delivery, integration, hybridization, 
expression, replication or integration in a cell in vitro or in an animal, preferably a 
mammal, more preferably a human. 
25 %e alx)ve di^losure generally ^^d preseht invention. A more 

complete understanding can Se obtained by reference to the following specific Examples 
which are provided herein for purposes of illustration only; and are riot iriterided to 
limit the scope of the invention. 
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EXAMPLE 1 
Differentially Expressed Genes 



This example demonstrates ^ polynuc^otides that are differentially 
5 expressed inhuman breast or colon cancer pell lines. 

Human cell lines used to identify differentially expressed 
polynucleotides are/the hiunan breast cancer cejl lines MCF-7 (non^metastatic), MDA- 
MB T ?31 (metastatic to bone and/or lung), and J^ArMB^35 (metastatic to lung) 
XBri^<^^'-^Heai^.J980 ? Cancer Ae* /0:3U8)», cancCT : cell lines 

10 Kml2C (low metastatic) and Kml2L4A (highly metastatic) (Morikawa et a/., 1988, 
CancerRes. ^.1943-1948). 

RNA was prepared from each eel} line and reverse transcribed to form 
cDNA. The cDNA was amplified using random primers. Amplification products were 
visualized on a ^eqwncing gel, and cDNA corresponding to mRNA which was 
15 differentially expressed in the cell lines was identified. 

Expression patterns and sequence identification numbers of novel 
metastatic marker polynucleotides are shown in TaMe 1. 
: 1 Expression patterns and sequence identification numbers of metastatic 

marker polynucleotides which correspond to known genes are shown in Table 2, and the 
20 corresponding proteins are t described bel 7 6w. 

Osteopontin (SEP ID NO;64) (QPN or Sppl for secreted phosphoproteiii 
1) is a secreted extracellular matrix protein, often expressed during wound healing, 
involved in osteoclastic differentiation and activation, as described in Heym^nn et al^ 
1998, Cytokine 70:155-168. Osteopontin is found in bone and other epithelial cells, and 
25 has . b^n r ^owii to stimulate pipliferation of a quiescent subpopulation of prostate 
epithelial cells (see Elgavish et aL y 1 998, Prostate 35:83-94). 

Qsteopontin is implicated during the development of diabetic: 
iiephroi^thy (Fischer et aL 7 1998, Diabetes 47:1 512-151 8); the process of cartiia^e-to- 
bone transition during rigid bone healing after -bone fracture (jhiakase et at. 1998, Acta 
30 Histpchem 700:287-295); wound healing by an interaction with the receptor integrin 
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alpha(v)beta 3 after focal stroke (Ellison er a/!, 1998. Stroke 29:1698-1706); integrin 
receptor binding and signaling during cell attachment and mechanical stimulation of 
osteoblasts (Caryalho et al., 1998, J. Cell Biochem 70:376-390); kidney morphogenesis 
V (Denda «/ a/., 1998' Ma/. tiioVCetl £1425-1435); and as an interstitial chemoattractant 
5 in renal inflammation (Rovin and Phan, 1 998, Am. J. Kidney Dis'Si'M 065- 1 084). Mice 
lacking the osteopontin gene showed modulation in osteoclast differentiation from wild 
type mice (see Kittling eia^i^ t -J:BoMAe^'Res?l3:li6i-ili 1). 

sites,, and can promote leukocyte adhesion uSrou^ me alpha 4betaT : integrin, as 
10 described ui i Bayiess e/ \99$J. Ceil Se t jflhi 165^1 174, bsteopbritih is' 
transcriptionally regulated by retinoic acid (see Manji '<?/ al., 1998, J: Cell Physiol. 
176:1-9); preferentially expressed in high grade metastatic brain tumors compared to 
low grade brain tumors, and inducible by tissue plasminogen activator (tPA) in glioma 
cell lines (see Tucker 'etaL? 1998, Anticancer %s} /^:^812). Ostetip^hun is 
15 expressed in about 73% of primary gastric carcinoma tissues and cbrrelated with the 
progression of human gastric carcinoma arid lymphogenous metastasis (see Ue et al., 
, ,1998, Int. J. Cancer 79:127-132). 

' iip. (SEQ ID r46:65) is described in Boyd e/ a/.; 1W4, Cell 79:341-351. 
Adenovirus E1B 19 kDa protein protects agamst celf deaui induced by viral infection 
20 and external stimuli, and can be functionally suos^tutfed with i^Bi^fxok^g^ 
^ E1B 19 and Nip3t^r^ aiscovered in yeast two- 

hybrid studies conducted discern proteins that interact with 19 kDa protein, as 
described by Boyd e/ a/., supra. Nip 1, 2, and 3 interact wiuS ^fiscre'te domains of E1B 
19 kDa, and similarly also interact with Bel 2^ in both cases promoting cell survival. 
25 r^epe^ent protease (SEP ID NO:66lis Ca^-aependent pro^ 

called calpain), activity of \vhich b present in every vertebrate cell that has been 
Cammed. Ca'^-depehdent protease activity is associated with cleavages that alter 
regulation oir various enzyme activities, with remodeling or' disai^mbiy of the cell 
cytbskeleton, arid with deavageS of hbrinone receptors (see Goll & afj, 1992, Biokssays 
30 /4(8):549-556). Qf^eip^erit protea^ aaivity is regulated by Priding of Ca~ 2 ti> 
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specific sites on the; calpain molecule, with binding to, each site generating a specific 
response cprelated \yith a specific activity; (e.g., proteolytic activity, calpastatin binding, 
etc^), as described in Goll et qL Excessive adivajtipn of the <V 2 -dependent protease 
calpain may play a role in the pathology of disorders including cerebral ischemia, 

5 cataract^ myocardial ischemia, muscular, dystrophy, and platejet aggi^gation. 
Therapeutic applications include selective Ca"|-dependent protease inhibition- as 
described in Wang and Yuen, \^ y T^^ Pharmacol Sa 

IGF-R (insulin^like |rqwth fiactor receptor) (SEQ ID NQ:$7) is a 
transmembrane tyrosine kinase linked to the i^ r raf-MAPK(mJtpgen^activa^ 

10 x kinase) cascade ?uid required for the cell tp progress through the cell cycle (AVerner and 
Roith, 1997, CriL Rev. Oncog 5(l):7|-92). IQF-R mediates mitogenesis, #owth 
hormpne ^ action^ cell survival and transformation to and maintenance of the malignant 
phenotype. IGF-R is a member , of the growth factor receptor tyrosine kinase 
suj^rfamily v exi^ as covalent cipss-1 inked dimers where each monomer is compost 

15 of two subunits, and is, bound by ligand in the extracellular domain (Mclnhes and 
Sykes, 1997, Biopolymers 4J(5):339-366). 

The domains of the IGF-R are described in Sepp-Lorenzino, 1998, 
^Breqst Cancer Res Treat ¥7(3):235-253, including domains responsible for mitogenesis, 
transformation, and protection from apoptosis. IGF-R expression is increased in breast 

20 - cancer cells derived from . tumpr tissue tod cell .lines, as described in Suirnacz et al. : 




tumor mass and/or aid tumor recurrence by .promoting ppliferatipn, cell survival, and 



cell-cell interactions. Human pancreatic cancers oyerexpress IGF-R and its ligand 
(Kprc, 1 998, Surg Oncol Clin NAm 7( 1 ): 25-41 ) ? and expression of IGF4 juid IGF-R is 
25 determined to be a; prognostic factor (reflecting the interaction between the neoplastic 
cells and their microenvirpnment) for lymphpcytic infiltration in thryoid carcinonias 
(Fonsepe/^ 

ILGF-BP5 (SEQ ID NO:68) is insulin-like grovyth factor binding protein 
5, described in Allander et al. y 1994, J. BioL Chem. 269:10391-10898. The gene and 
30 promoter for 1GF-BP5 are characterized by /dlander et al~> 1994, J. Biol Chem. 
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269: 1 089 W6898" and general actions of IGF-BPs are described in Chan and 
Spencer, i^l^Ehdo^me 7:95-97. Potential' impact of IGF-BPs Cm cancer cell growth 
is descHb^in Oh; ^t^^;'^d^^^ : 7:l Inland Oh. 1998; Bred^r Cancer Res Treat 
V7:283493 iGFr^ is expired during bran development: 1GF-BP5 and IGF- 1 

5 mRN As are synchronously cioekpressed in principal neurons of sensory relay systems, 
including the ol&f6ry bUl^^ medial and dorsal 1^^' geniculate l^ies, ^ whtral 
tier, cochlear; leriinisfc^^ and vestibular nuclei, and are traiisi^htly coexpres»ed in 
principal neurons of the ^fer^oi^ nucleus, as diescribed in Bondy and Lee, 1993, J. 
Neurosci 75(l2):5(^-5iMi(jF-BP5 is expressed by luminal or cumulus granulosa 

10 cells in virtually ill ifbilicife,' Mid is highly abundant in sti^omal interstitial cells of the 
rhature ovary (see Mil and Bondy, 1993, BioL keprod 48:467-4^2): IGF-BP5 
induction i$ strongly stimulated during differchtlatioh of skeletal myoblasts Md is 
correlated with lGF-k activation as idescribed in Rousse e/ a/., 1998, Endocrinology 
139:1 487-1 493. IGiF-BP5 iartd otiher components of the IGf system are cnticial in 

15 postnatal brain d^el^i^t (se^ Lee et aL y 1996, J Cereb Blood Flow Metab 76:227- 

236). " 

I(jF BP5 stimulates bone cell proiiferatibh by an IGF-indeperident 
mech^ surface binding sites, as described in 

Mohan^^ «wihebtive tissue cell types, 

20 IGF-BP5 has alowdred b^ing%imty to ttie extracellular mitrix \^fch allows 1GF-I 
to be^ ih turii poteniia^ IGF-I a&ion on 

fibroblasts and <m<^ 

lactate k^v^gCTa^ (SEP ID NOf69) is^ a memfeeir 6f the LDH group 
Of tetrameric enzymes with fiver isoforms composed of combinations of two sufeunits. 

25 Lt)H-A and l>5fe khim^iai - 1997, Proc. /foti&adlSci^ described 
the relationship fetv^h U>H-A ^d neoplasia. In particular, bVerekpieskibn dh LDH- 
A may contribute to altered inet^lism that confer^ neoplastic growth advantage. The 
expression pattern 6f LDH in the present invention is consistent, in that LDH 
expression is hi^er in two Kneis than in a hon-metastatic 

30 breast cancer cell line(fable^ 
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in tumor tissue and cells M is associated with poor survival rate in snail cell lung 
.cwinoma (SCLC), as descriTbed in Ray et al y 1998, Cancer Detect Prev 22:293-304, 
^ making it a useful prognostic, indicalpr for S<XC as discussed in Stokkel et a/., 1998, J. 

5 , ? ; ; Ufo TKR (SEQ ID NO:70) is described in Schulz etaL 1993, Oncogene 
,5:509-513, This protein has been reported as a marker in tumors, but has not previously 
been reported in breast cancer. According to the present invention, expression is found 
'in .the : JMpA-MB-23 1 breast .cancer ceU line^ byt not ^^MSf-7/or-NIDA4^BU3S 
, cell lines. This gene and protein provide ijew makers for distinguishing breast cancer 
Id tissue of different types of metastatic potential 

. Initially isolated from primary human myeloid leukemia cells, the ufo 

oncogene (also called Axl or Ark) is a receptor tyrosine kinase (RTK). Its genomic 
..... ^ ^structure is de^ribed in Schulz etaL, supra^and its differential expression is described 
in Cipher et aL 1996, Leukemia 70:78 1-787. The ufo protein is a member of a class 
15 of RTKs having two fibrpnectiri type III domains and two immunoglobulin-like 
domains present in the extraceljular portion, aind is preferentially expressed in 
monocytes, stromal cells, and some Cp34-positjve progenitor cells (Neubauer et dL 7 
' i£$7, Leuk Lymphoma 25:91-96). Ufo has ian extracellular structure similar to neural 
cell adhesion molecules, and has direct or indirect binding sites for PLCgamma, GRB2, 
20 c-src, and Ick (Braunger et ah* 1 997, Oncogene 7*2619-2631). 

, . eIFr2 (SEQ ID NQ:71) is a translation initiation factor, and functions as 
a heterotrimerk ; GTP-binding protein involved in the recruitment of methioriyl-tRNA to 
the 40; S ribosomal subunit (Gasper et aL y 1994, J Biol Chem. 269:3415-3422). 
According to the present invention, higher expression is found in two metastatic breast 
25 cancer cell lines; and not in cell lineMCF-7. 

% eIF-2 is involved in introducing the initiator tRNA into the translation 
mechanism and performing the first step in the peptide chain elongation cycle. elF-2 is 
associated \yith a 5 subunit molecule having GTP recycling function called eIF-2B 
(Kyrpides and Wowc, 1998, Proc. Natl Aca± ScL USA 95:3726-3730, and Kimball e/ 
30 a/., 1998, J. W.OcW;27i:12$41-12845). 
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eIF-2 has subiinits alpha and befcL elF-2alpha is phosphorylated at Ser 
51 and then modulates the interaction of eIF-2 and eIF-2B, as described in Kimball el 
al., 1998, Protein Expr. Purif. 72:415-419, Kimball et al:, 1998, J. Biol. Chem. 
275;3039-3044, arid Pavitt 1998, Genes Dev. 72:5 14-526. ' It is reported' that by 
5 regulating translation initiation, control of cell growth and division in eukaryotic cells is 
achieved: for example, clotrimazole, a potent ahti-proliferative agent in vitro and in 
vivo, depleted intracellular Ca* 2 stores, which activates PKR, resulting in the 
phosphorylation of eIF-2alpha, arid the ultimate inhibition of protein synthesis and 
blockage of the ceil cycle in G l phase (Aktas et all, I 99i, Proc. Nati Acad. Set USA 
10 95:8280-8285). Additionally, Kim el ai , m&, M6L Ated. *179-190, show that nitric 
oxide fNOysupfffesses protein syndesis in cell types including human ovarian tumor 
ceils by stimulating phosphorylation of elF-2alpha. 

Gliitaminvl cyclase (SEQ ID NO:72) is described by Song et al., 1994, 
J. Mol Endocr inol . 13 J1-Z6, and is expressed most highly in the most metastatic cell 
15 line MDA MEM35, as compared to' less metastatic line MDA-ltfB-231 and non- 
metastadc line MCF-7 Glutaminyl cyclase (also called giutamine cyclotrahsferase) 
converts glutaminyl-peptides (such as gonadotropih-releasing hormone and thyrotropin- 
releasmg hormone) into pyrogiutamyi-peptides, as described in Busby et <*/., 1987, J. 
Biol Chem. 262:8532-8536! Fischer and Spiess, 1987, Proc. hfal'l Acad. Sci. USA 
20 «*3628-3632, and PoM et al.. 1991, Proc. Nat'l Acad. Sc/. ««: 10059- 10063. Cloning 
and sequence analysis of glutaminyl cyclase derived from a human pituitary cDNA 
library is described m Sorig e/ a0994, J. MoirEndo^inol! 13W$6. Studies on the 
' catalytic pathway i>f glutaminyl cyclase and its substrate specificity are described in 
' Gololobov et a£ 1996, Biol. Chem. Hoppe Seyjer 577:395-398.' Assays for the 
25 presence of glutaminyl cyclase activity are described in Koger et all, 1989, Method 
Enzymol 765:358-365 arid Houseknecht etal, 1998, Biotechniques 24:346. 

gpl3Q (SEQ ID NO:73) is transmembrane protein glycoprotein 130. 
gpl 30 is a signal transducing shared component of the receptor complexes for the 
interleukin-6 (ji>o>type cytokines (Hirano et ' aL, 1997, Cytokine Growth Factor Rev. 
30 8:24 1-252). including IL-6, IL-11, leukemia inhibitor factor (LIF), oncostatin M 
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<OSM)f ciliary neurotrophic factor and cardiotrophin-1 . The N-teiroinal of gpl 30 is an 
extracellular immunpglohulin-Iike portion of the protein (Hammacher el al. 7 1998, J. 
fl/oL^ transduction including gp| 30 occurs through 

the gplWJak/STAT pathway 1 (Heinrich 1998, Biochem. J. J3*:297-314)- The 
5 cyjqlanes actinjg through the pathway that includes gpl 3Q (alsf? called gpl 30 : cytokines) 
. .exhjhit pleitrppic biological activities including inununej hemopoietic, and r peural 
v t . effects (Nakashima and J^ga, 1998, Semin HematpL 35:210-221, Thompson ef al. t 
1998, Neuroscience 84:l247rl25^ Rev. Immunol ^ 76:249-284, Marz 

M al;, l997„\Eur. J. NeyrpscL 1998, //i/ Immunol 

... . ;v ( v .gpl30 c^okines-are reported to control survival and proliferation of 

myeloma cell lines and primary myeloma cells (Klein, 1998, Curr. Opin. HematoL 
^ r 5:i^I9.I). gpl3Q « cell carcinomas and has an 

vimportapt iple Jn the proliferation of some renal cell carcinoma cell iine$ (Costes el aU 

15 ,1997^ C//n.^ , , r > 
E^cadherin (SEQ ID NQ:7$) is a member of a family of glycoproteins 
, jespohsible v for ^Icium-dependent cell-cell adhesion and is implicated in maintaining 
< ^ytoskeietal integrity. , Epithelial c^^ adhesion system 
r • iji pancer cells is inactivated b^ : multiple mechM^ the pathological 
20 features;of ^ the, partitml^ tumor type (Hirohashu \99% T Am J Pathol /J5:333-339). In 
, : .general jhe ;;p^nn^stein m^a^s.Ca^^ cell-cell adhesion. 
Transcriptional ^inactivation of , E-cadherih Expression occurs frequently in tumor 
^progreissi<Hi; and thus inactivation or downregulation of* E-cadherin plays a significant 
r jple in niultist^ge, carcinogenesis (Hirohadii, 1998, A m J Pathol /53:333-339). 
25 E^cadherin is characterized as a^ tumor suppressor of the metastatic 
Xphenptyj?^ as described in MacGro^n and Bookstein, 1997, Semin Cancer BiqlSrAX- 
,19, and radherins are : important determinants of tissue morphology including invasive 
carcinoma as described in van der Linden, 1996, Early Pregnancy 2:5-14, and Yap, 
1998, Cancer Invest. 16:252-261. , r 
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Mechanisms of action of cadherins are discussed in Daniel and 
Reynblis^ 1997, Bioessays 79:883-891/^^^ 

mdiecules including E-cadHerin are described iri Joseph-SilVerstein and SilVerstein, 
mpl Cancer Invest 76:176-182, Yape/ a/., 1997. Anna: Rev. Cell Dev. Biol: IM\9- 

5 H^W benium, 1998, Ce// 9i:1095-l0#: Cell adhesion tiiofoules ^ 
cadhenn are potential targets for aiiti-^oer drugs aihd thdaj^ut^ 
chronic inflammatory disease as described ' iri Buckley l^d^ 
^d^ i:449-456, Moll and Moll, 1998^ Vir chaws Arch V52:487-504/ ? 

According to the present invention/ E-^adherin is expressed iri nori- 

10 metastatic breast cancer cell line MCF-7, and not in MDA-MB-23 1 and MDA-Kffi-435. 
the expression products are dia^ostic iharkd^ indicating tte potential of 

breast cancer tissue samples. ' ; • " - 

Seirpih (SEQ ID N&76X senile prbteas^ inhibitors^^are a family of 
protea^ inhibitors that inhibit chymotr^sin-like serine proteases (Whiis^k a/;, 

15 1998, Trends Biochem. Sci, 25:63-67) and that have the unique ability to regulate their 
a<riwty by changing the confemiation of their reactive^ent^r toop; studies of serpin 
V^iant^ provide definition for tfe functional domains of senilis that control the folding 
and lirik serpins mutations to disease <^ee Sieiri and CarrelK 1995; Nat: Struct Biol 
2:96-1 li)J Serine protease cleavage of proteins is essential to a Wide variety of 

20 biological processes, and the cleaV^e is primarily regulated by th£ clekv^e irihibitors; 
• ^d^nbe<i in 199<£ 5^ 

included 

^^asniin (PAl-( and PAf^) (Aiidreasen et itL 1997, fa: J: Carter ;72:l-22), 
thrombin, urokinase plasminogen ^diVatdr, and kallikreiri (twg^ and Houehou, 
25 i^Mm Res Brain Res Rev 25:85-95) • Some serpiris also haive other activities 
indidirig neuronal differentiating ai$ surv^ 

Biol: 425:332-217) tumor ^ppr^^on (Sager e> ii/ 1 997, Adv ^p ^ Biol 
! ^25:77-88). PAI-1 and PAl-2 are linked to ^ncer metastasis, as described in 
Andreasen et al, 1997, Int. J. Cancer 72:1-22. 
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, pS2 (SEQ ID NO:T7) was isolated from MCF7 human breast cancer 
cells, as described in Takahashi et al. y \990 r fEBS Let^^ pS2 is 

estrogeh-r^ula^ reported that 

the pS2 status declined; firo^ P S2 
\ expression alsq is assp^iated with a good prognosis in breast canqer patients. According 
tp;the present invention* pS2 is expressed .in MCF-7 fells, but npt in two metastatic 
breast, caiicerxell lines . .. . . . n a .[ ■., .. 

vPS2 ;(pr^nHin-2 or trefoil factor 1 (TFF |)) is a trefoil polyjpeptide 
.nonnally expressed in the mucosa of the gastrointestinal tiact^ and found eptopi^cajly in 
10 gastrointestinal inflammatory disorders and various x^inom^ 

1997, J. PjathQl 183:4-1. v pS2 is expressed in br^ (Poulsom et aL, 1997, J. 

Pathol /5530 7 38). pS2 is a jpleitropic factor involved in mucin poj^erization, cell 
... ; mtrfili^ 1997, J. : Clin. (^stroertterci ^ 25(1 ):?94-S 100), cell 

proliferation and/pr differentiation, and po^sibl^ in the nervous system (see Ribieras et 

LIV-1 (SEQ ID NO:78) is an estrogen-regulated protein reported in the 
. (Green et a!., GeneBank ^ No. U41060). 

According to the present invention, LJV-1 is expressed 1 in ^ MCF-7 cells, but not in two 
tf .; metestatip bre^ c^er cell lines. ... 5 . . r . . ...;. J[ . / .. 

20,. - ^ 

family (LIV-2v 3, and 4) are binding riroteins that represent a transport system for 
branched • chain ^ino^ acids in E. coll; as ^ .described in, Yamamojo et a/., 1979, J. 
BacterioL 733:24-32, and Yamamoto and Anraku, 1980, BacierioL 144:3644. A 
human tomologue to LlV-J is teth estrogen and growth factor inducible in MCF-7 
25 r; huip^ breast, cancer, cell line (EhTanw^ 1997, J. Steroid, Biachem. MpL 

Biol 60:269-276; El-T^^i and Green, l996 A MolC^ /^ 7, -77;.?nd EI- 

; ; v . GTP-bindim> protein (SEQ ID NO:79) is a member of the family of 

guanine nucleotiderbinding regulatory proteins, G proteins. The protein is expressed in 
30 MCF-7 cells, but not in two metastatic breast cancer cell lines; 
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G proteins provide signaling mechanisms for the serpentine family of 
receptors as described in Dhanasekaran and Prasad, 1998, Biol. Signals Recept 7:109- 
1 17. Studies indicate that the alpha as well as the beta gamma ^ubiiriits of the OTP- 
binding protein^ are involved in the regulation of several cellular responses, some: of 

5 which responses are critical to the iegu^ of cell growth and differentiation 
(Dhanasekaran and Prasad^ 1 99$, Bidt Signals Recept 7:109-1 17): G protdn coupled 
receptors regulate the mitogen activated protein kinase pathway as described in Russell 
and Hoeffier, 1996, Invest: Dermatol Synip Proc 7:1 19-122- arid thus play a role in 
controlling cell ^v^l^GtP Binding prbteins are also implicated in the regulation of 

10 intracellular t^^ 

Chembkines induce various intracellular signaling pathways in natural 
killer cells by activating mem&rs of CjTP binding prbteins as described in Maghazachi 
and Ai^AuOkaty, \99^ FASEBJ. 72:913-924; Hetefbtrimeric GTP binding proteins 
regulate disUnct signaling pathways sbmb 6f which in turn regulate the activity of 

15 Na+/H+ exchanger proteins as described in Voyno-Yasenetskaya, 1998, Biol Signals 
Recept 7;118-12C 

D^smoplakin (SEP ID ftO:84) is a member of a family of proteins that 
serve as cell surface attachment sites for cytoplasmic intermediate filaitieiits: 

Vimentin (SEQ ID NO: 80) is a member of the iht^ediMe filament 

20 gene family (iEvans^ 1 998, Bioessays 20:79^6. Intermediate filaments are a major 
wmjioneiiif of the cytbskeletbn of hi^^ ^kmyotes: VinSentih gene kiiockout mice 
indicate degeneration of the ^rebellar Piirkinje cells (Galou el tff^ i997, Biol Cell 
#P:85-97). Vimentin is poshive in imm^^ reactions of Sarcdirias and 

related lesions (Gaudin et aA, 19^8, ^/^M/22:I48-I62); aind biF^esmo^astic 

25 small roundkxdl tumors and iheir variants (Gerald et ai y 1998, J: Cliti O^dl: l6iW2Xr 
3036) Viirientin is also expres^ in neopla^ 

differentiation as described ill Perez-6f<lbnez and RosaL 19^8, iSemin:Did^:PathoL 
15:144 154, and in biph^ mixed mullerian 

tumors as described in duaririoe/ aL; 1998, Tumolri 84:39\ -397. 
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Cytochrome C Oxidase (CcO) (SEQ ID NO: 81) is the terminal enzyme 
of the respiratory chain of mitochondria and aerobic bacteria: it catalyzes electron 
transfer from cytochrome C to molecular oxygen, reducing the oxygen to water (Michel 
et aL 1998, Annu Rev Biophys Biomol Struct 27:329-356). Cytochrome C oxidase is a 
5 member of the superfamily of quinol and cytochrome C oxidase complexes that are 
related by a homologous sutmnit containing six positional ly conserved hi sti dines that 
ligate a low-spin heme and a heme -copper dioxygen activating and reduction center as 
described in Musser and Chan, 1998, J. MoL Evol. 46: 508-520. Cytochrome C and 
ubiquinol oxidases ^are„ membrane-bound redox-driven proton pumps which couple an 

10 electron current to a proton current across the membrane (see Karpefors et ah 9 1998, 
Biochim Biophys Acta 7J&5:159-169). Analysis of mutant forms of cytochrome C 
oxidase is;described- in Mills and Ferguson-Miller, 1998, Biochim Biophys Acta 365:46- 
52. Nitric oxide inhibits respiration at cytochrome C oxidase, as described in Torres et 
al y 1 998, J. Bioenerg Biqmembr 30:63-69. 

15 Heat shock protein 90 (hsp90) (SEQ ID NO: 82) acts as a chaperone 

molecule in association with the glucocorticoid and progesterone nuclear receptors, and 
. has A, B, and Z regions for facilitating these interactions (Dao-Phan et al. 3 1997, Mol 
Endocrinol 77:962-972). Levels of hsp90 are reported elevated in active systemic lupus 
erythematosus (Stephanou et ah, 1997. Biochem J 527:103-106). Increased hsp90 

20 \ expression is implicated in regulation of forms of cell injury that lead to programmed 

r cell death as describ^ in Gale^ et a/.^1996, J. Immunol. 757:4109-41 18. Hsp90 
is upregulated in regeneratinjg fibers and diseased fibers of Duchenne muscular 
dystrophy (Bornmaii et aL, 1996, Muscle Nerve 79:574-580) ? , and is a candidate 
substrate for proteolysis during ionizing radiation-induced apoptosis of some breast 

25 cancer cells (Prasad et al 1998, Int. J. Oncol 73:757-764). Hsp90 is invoked in 
dislocation of the mutant insulin receptors from the endoplasmic reticulum to the 
cytosol as described in Imamura et qL 1998. J, BioLChem; 273:1 1 183 I I 188. and 
associates with and activates endothelial nitric oxide synthase as described in Garcia- 
Cardenae/a/.. Nature J92:821-824. 
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. Integrin alpha 6 (SEQ ID UO: 83) is 'ih-~&e:"-faiiiny of integrins, 
heterodimeric* cation dependent cell membrane adhesion molecules that mediate cell- 
cell and cell-matrix interactions. Integrin alpha 6 is a component of the 
heiHidesmbsome romjjiex (Johes et a/. ; 1998, kioes^s ^4^-494) Integrins 

5 maintain tissue integrity and regulate cell proliferation, growth differentiation; and 
migration. (See Thomas et al: y 1997, Oral Oncol 33:38 1 -3$8); In oral squamous cell 
carcinomas there is a variable loss or reduced expression of iritdgrih alpha 6, as 
described in Thomas et a! 1997, Oral OncoL33:3S 1-388. Alpha 6 integrin also plays 
an active role in invasion of intestinal and diffuse-type cells of representative gastric 

10 carcinoma cell lines as described in Koike et al.. 1997, J. Cancer. Res. ClfcOhcoi 
/2JI:310-316. 

O^eo^^ (OP-1) (also called BMP-7) (SEQ ID Ni6: 85) is a 

morphogenetic factor (and a member of the bone morphogerietic protein (BMP) family 
of growth factors) and is highly expressed in kidney and involved in tissue repair and 

15 development (see Almanzar et al., 1998, 1 Am. tydNephroL 9 A 456-1463). OP-1 is 
also expressed in the developing nervous system and can induce dendritic growth in 
sympathetic neurons as described in Guo et 1998, Neurosci. Lett 2^5:131-134. OP- 
1 stimulates cartilage formation as described in Klein-Nulend et aL 1998, J. Bidmed. 
Mater Res. 40:614-620^ 

20 OP-1 induces down-regulation of insulin-like growth factor binding 

proteim (i^rticularly IGFBP-5) thus aitecting IGF -1 ill the context of tx>n£ cell 
differentiation and mineralized bone nddute formation as descried in Yeh et aL, 1997, 
Endocrinology ;j«:4181-4190. OP-1 can be used as a bone graft substitute to promote 
spinal fusion and to aid in the incorporation of metal implants (Cook and Rueger. 1 996, 

25 Clin. Orthop 324:29-3%). The three dimensional structure of bP-1 is reported in 
Griffith et al ; I996 y Proc Natl Acad Set 93^ 

The protein encoded by SEQ ID NO:56 is a putative secreted protein arid 
is highly expressed in fat tissue. 
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. Table 1. Novel Differentially Expressed Metastatic Marker Polynucleotides 



TRANSCRIPT 
NUMBER 


SEQ ID 

NO: 


: non- . 
metastatic 
..breast 
;MCF-7 

- . . • 


breast 
..cancer 
metastatic 
to bone 
and/or lung 

mda-Mb- 

231 

... \ , 


breast 

cancer 

metastatic 

to lung 

MDAi 

MB-435 


|9 W ; : M 
metastatic 

from 
• - 
colon 

KM12C 

... ■• . ... 


metastatic 
from colon 

lAlf til A' A 

KM12L4A 


9Qi ; 


1 




+ \ 








907 

t 












• .-. ? 


9102b ; 


. ... 3. 




* 




. ■ . _ . 




-9114 i 


.•4 






+ . 






9121a 


... > 




j. 


- 


. . . \ 




9129 




+; 


- . 


+ 


■ - • 


i. ■ ■ . 


9139a 




+ 






... 


... ..... 


3 

9143b ■ 


8 






- 


i 


,>' " 


9157b J 


9 




■ '■ ■ 


+ 


I 




9166 


10 






- 






9170b ! 


11 








. ... 




9190a ; 


12 


+ 










919! 1 


13 












9216 


1 A 

14 




Ti.. . 


+ i 






9224c . 


15 


+ 










9230b . 


i ^. 
16 


+ 










.924 


ill 


+ 






• i 




9242a . 


18 




1 






•■ ■-■ 


OO <Oo 1 


:iv 










: i 


9261 : 


]20 








i 




9272 ! 


21 ' 












9293b 


■22 ••; 




+ 




• I"' 




9304b : 


23 


+ 










9307a 


.24 












931 


^25 


+ 


1 








9313 


J26 
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■■ i 
TRANSCRIPT 
NUMBER 


SEQ ID 
NO: 

■ :■: 


rion 1 

metastatic 
breast : 
MOF-7 


breas t " ~ ; 
cancer : - - 
mciasiaiic 
to bone! - ' 
and/or lung 

lvi L//\-iyi o - 

23 1 


1ire^t^ r ' 
cancer — 

IllClilolfUIC 

to lung 

MDA-j ^ 
MB-435 


low 

metastatic 

frrun 
11 vnii 

colon 
KM12C ^ 


high 

metastatic 
1 rom colon 
KM12L4A 


9316 


27 


+ 










9318b 


28 













9320a i 


29 












9330b 


30 












9335 


31 




. . . _ y . .. 


_; 




■ • . - ■ } 


9337 


32 


+ 




+ 






9342b 


33 








i 




9343c 


34 


+ 


- : .. 








9350e 


35 


■ ■ -4 






; " | " ' ' 


, : ' * (■- . \ 


9351b 


36 




-*-.;. 






.■*:<: £ 1 ■; ■ 


9361 


37 






— 


i 


■■■"' ; 7 : : 


9368 


38 












9373b 


39 


i 


- 








"9385a v 


40 




- ! 


+ 






9386c ; 


41 








i 




9388d 


42 


+ 




r 


. ... -t . . .. 


^ J5 ■ 


9390 


43 


4." 






\ ■ ■ 




9393 


44 


+ 


; -J 




■ -i- ■ 




9396 


45 








i : 




944b I 


46 




i ■ 




.i : • -- 




951 ; 


41 








V 


.* >"V 


953 - 


48 








... . ^ . . . 




954a 


49 


■ H" 










968 


50 












971 


51 












983c 














985 


53 












990 


54 






+ 
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TRANSCRIPT 
NUMBER 


SEQ ID 

NO: 


non- 

metastatic 
breast 

Mcf-7 ; 

* 


breast 

cancer 

metastatic 

to bone , 

and/or lung 

MDA-MB^ 

231 


breast 

cancer. 

metastatic 

to lung 

MDA- 

MB^435 


low 

metastatic 
-from 
'colon' :i : 
IKM12C 

■j 


high 

metastatic 
from colon 
-KM12L4A 


998 


55 












316 


56 


+ 


j 




+ 




126c 


57 








• 


s 


207-4 


58 




+ 








265-3 


59 












29B 


60 J 




3- ..... ". 








305B-25 


61 


+ 










326B-39 


62 


+ 










34B-1I 


6 3 j 













+ indicates differential expression as identifieiin differential display 



- indicates absence in diflferential display * ■- 

! For transcript number 31 6 % reverse trajis^rit/qn PCR (RT-PCR) was 
used to detect expression in the breast fcancer cell lines. > ■ , 



Table 2. Differentially Expressed Metastatic Marker r?Qlvnueleotides < 



TRANSCRIPT 
NUMBER 


' protein 


'SEQ 
ID, 

NO: .... 


hon- ^'>>;^ 
metastatic ^.7 
breast* V.\/" 


breast 

cancer 

metastatic 
;to boiie 

and/or lung 

MDA-MB- 
^23) ■ 


breast * 
cancer /. 
metastatic 
to lung 
MDA-MB- 
435 


902 


osteopontin 


64 ^ 






. "?>+;*" 


9112 


nip 


65 




+ 




9132 


Ca-Kfependent . 
.protease 


66 








9158 


IGF-R 


67 


+ 






9174 


ILGF-BPS 


68 
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. TRANSCRIPT , 
" ^NUMBER :, , 


protein . 


SEQ 


hon- 

breast 
MCF-7 


breast 

metastatic 
to bone ? 
and/or lung 
MDA-M&- 

231 | ... 


breast 

cancer 

metastatic 

to lung 

MDA-MB- 

435 


9177 


lactate • ; 
dehydrogenase 


69 




... + .i 

' • . :- 




9202; 


ufoTKR 


70 




\..+ ; 




9210 


elF2 


71 


- 


S. 




9212; 


; glutaminyl • 
cyclase 


72 








9213 


gp!30 ' 


73 




' " ' " •" 


;j 


9222i 


TGFb-II 


74 


? .. i 






9232; 


E-cadherin ^ 


75 


i " ' + 




— t4l 


9239 


serpin 


76 








9250 


secreted pS2 


77 


+ 






9260 


LIV-1 


78 


+ 






9315 


GTP-btnding 
^f>roteilr ; . 


79 


+ 


- 


- 


9317 


vimentin 


80. i 


, .... . ... t „ . . : 






938 


cytochrome C 
oxidase 


81 


+ 








Hsp90 


\"<82 v "- 




■ '.v*, ... ••'•:-< 


+ 


9394 


integrina6 .><*■; 








. i 




deOToptaktri- 


i C 




) ":u; 




970 


osteogenic 
protein 


"' 85 


+ 







+ indicates differential expression as identified in differential display 

r indicates absence in differehtial display; ^ ; , -1 

Within the scope of the iiivent^ the proteins described 

above. A variant is a protein encoded by a polynucleotide wherein the global sequence 
identity of the DNA, as compared tothe corresponding SEQ ID NO: herein, is^at least 
65% as determined by the Smith-Waterman homology search algorithm as implemented 
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in MPSRCH program^ gap search with the following 

search parameters: gap open penalty of 12, an^ gap extension penalty of 1. The protein 
encoded by the DNA having the sequence identity described above will exhibit the 
percent activity described in the preceding paragraph. 

5 Also within the scope of the invention are fusion proteins comprising the 

proteins and variants disclosed herein. Proteins preferably used in fusion protein 
construction include beta-galactosidase, beta-glucuronidase, green fluorescent protein 
(GFP), autofluorescent proteins including blue fluorescent protein (BFP), gliitathione-S- 
transferase (GST), luciferase, horse radish peroxidase (HRP) and chloramphenicol 

10 acetyltransferase (CAT). Additionally, epitope tags are used in fusion protein 
constructions, including Histidine (His) tags, FLAG tags, influenza hemagglutinin (HA) 
tags, Myc tags, VSV-G tags, and thioredoxin (Trx) tags. Other fusion constructions can 
include maltose binding protein (MBP), S-tag, Lex A DNA binding domain (DBD) 
fusions, GAL4 DNA binding domain fusions, and Herpes simplex virus (HSV) BP 16 

15 protein fusions. 

These fusions can be made by standard procedures in the art of 
molecular biology, and many are available as kits from, for example, Promega 
Corporation (Madison, WI); Stratagene (La Jolla, CA); Clontech (Mountainview, CA): 
Santa Cruz Biotechnology (Santa Cruz, CA); MBL International Corporation (MIC. 

20 Watertown. MA); and Quantum Biotechnologies (Montreal, Canada). 

The proteins of the invention, and variants as described herein, can idso 
be used to detect protein interactions in vivo, using the yeast two-hybrid system, for 
example as described in U.S. Patent No. 5,674,739. 

In addition to the ribozyme and antisense constructs described above, the 

25 polynucleotides of the invention can be used for inhibiting transcription via triple helix 
formation as disclosed in U.S. Patent No. 5.674,739. 

Those skilled in the art will recognize, or be able to ascertain, using not . 
more than routine experimentation, many equivalents to the specific embodiments of 
the invention described herein. Such specific embodiments and equivalents are 

30 intended to be encompassed by the following claims. 
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artihcbrporated by reference as if set forth fully herein: 
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• _ . ;; ; CLAIMS , _ - . 

We claim: 

1. An isolated and purified human protein comprising an amino acid 
sequence which is at least 85% identical to an amino acid sequence encoded by a nucleotide 
sequence selected from the group consisting of SEQ JD NOS: 1-63 or the complement thereof. 

2 T . The. isolated and purified human protein of claim 1 wherein the amino 
acid sequence is at least 95% identical. 

. 3. The isolated and .purified human protein of claim 1 wherein the a?ni no 

acid sequence is encoded by,. a sequence selected from the group consisting of SEQ It) 
NOS: 1 -63 ,or the complement thereof 

4, A fusion protein which comprises a first protein segment and a second 
protein segment fused to each other by means of a peptide bond, wherein the first protein 
segment consists .of at least six contiguous $mino acids selected from an amino acid sequence 
encoded by a nucleotide sequence selected from the group consisting of SEQ ID NOS: 1-63 or 
the complement thereof. 

5, A preparation of antibodies which specifically bind to a human protein , 
which comprises an amino acid sequence encoded by a nucleotide sequence selected from the 
group consisting of SEQ ID h)Q.S:l-63 or the complement thereof 

6, A method ! for ^ d^ecting metastatic tumor cells in a tissue sample, 
comprising the step of: 

measuring in said tissue sample an expression product of a gene which 
comprises a coding sequence selected from the group consisting of SEQ ID NOS:l, 2, 4. 5, 
9, 11, 13, 14, 18. 19. 20, 22, 24, 26, 29, 30, 33, 35, 36. 38-41, 45, 48, 52, 55, 57, 58, 60. 63- 
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66, 69-74, 76, 80* 82, and 83, wherein a tissue sample which expresses the product is 
categorized as containing metastatic tumor cells. 

7. The method of claim 6 wherein the expression product is protein. 

£ The method of claim 7 whereih the protein is mea^iffied using; an : 
antibody Which specifically binds to the protein. 

9; A method for detecting metastatic tumor cells in a tissue sample, 

comprising the step of: 

measuring in a tissue sample an expression product of a gene which comprises 
a sequence selected from the group consisting of SEQ ID NOS;3, 7, 8, 10, 12, 15-17, 21, 23, 
25- 28, 31, 34, 37- 42-44, 46, 47, 49-5 U 53, 59, 61, 62, 67, 68, 75, 77-79, 81, 84;1arid 85/ 
herein a tissue sample which does not express the product is categoriz^ ^ metastatic. 

10. The method of claim 9 wherein the expre^iori product is protein. 

1 1 . The method of claim 10 wherein the protein is 'measured using an 
antibody which specifically binds to the pfbtein. 

12. A method for determining metastatic potential in a tissue sample, 

comprising the step of: 

measuring in a tissue sample an expression p^uct of a 
a sequence selected from the group consisting of SEQ ID N0S:1, 2, 4, 5, \X 14, 18, 
19, 20, 22 ; 24, 26, 29, 30, 33, 35, 36, 38-41 , 45, 48, 52, 55, 57, 58, 60, 63-66, 69-74, 76, 80 r 
^, and 83, Wherein a tissue s^le wWch expresses the pr^iict is categorized as having 
metastatic potential. 

13 The method of claim 12 v^erein the expre^iori product is protein; 
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J4. TTie method; o£ claim L3 wherein the protein is; measured using an 
antibody which specifically binds to the protein. 

15. A method for determining metastatic potential in a tissue sample, 

■••*■• ' • « K •' r "iC^". . ;.. jf ^::-. :t0 ■ .■rfi'.' I • 

comprising the step of: 

measuring in a tissye sample an expression product of a gene, which comprises 
a sequence selected from the group consisting of SEQ ID NOS:3, 7, 8, 1Q, 12, 15-17, 21, 23, 
28, 31, 34,, 37, 42-44, 46, 47, 4?-£l, 53, 59, 61 , 62, 67, 68, 75^ 77-79, 8 C 84, and 85, wherein 
a tissue sample which does not express the product is categorized as having metastatic 
potential. 

1 6. The method of claim 1 5 wherein the expression product is protein. 

17. The method of claim 16 wherein the protein is measured using an 
antibody which specifically binds to the protein. 

18. A method of predicting the propensity for metastatic spread of a breast 
tumor preferentially to bone or lung, comprising the steps of: 

measuring in a breast tumor sample an expression product of a gene which 
comprises a sequence selected from the group consisting of SEQ ID NO: 1, 5, 11, 18, 20, 22, 
24, 30, 33, 35, 36, 38, 45, 52, 58, 65, 66,70, 74, 76, and 80, 

wherein a breast tumor sample which expresses the product is categorized as 
having a propensity to metastasize to bone or lung. 

19. A method of predicting propensity for metastatic spread of a breast 
tumor preferentially to lung, comprising the steps of: 

measuring in a breast tumor sample an expression product of a gene which 
comprises a sequence selected from the group consisting of SEQ ID NOS:2, 4, 9, 13, 14, 19, 
26, 29, 39-41 , 48, 55, 57, 60, 63, 64, 72, 73, 82, and 83, 
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56 



wheiein a breast tumor 
having a propensity to metastasize to lung 



as 



7 20. A method of predicting pir^hsity for metastatic spread of a colon 
tumor, comprising the steps of: 

' measuring in a colon tumor sample an expr^ion pr^ucit of a gene which 
comprises the nucleotide sequence shown in SEQ ID ttb:56, 

wherein a colon tumor sample which Expresses the product is 
having a low propensity to metastasize. 



as 
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SEQUENCE LISTING 

<110> Chiron Corporation 

<120> METASTATIC BREAST AND COLON CANCER 

REGOLATBD GENBS . ,/ i 

. <i^iy26pi30,46p'' ... . j ; : ] : ^\^' r \~l .' ..." li. * . . 

<i4o> :pct . JCX. : -. .V'"".. X'}"; • ; ' 

. jii7Q> Fasts^ ~ ' ". 1. 

'^<2ii>/i42 •'"',.'./ -V"*^ " • ; : V " : ./\ .' ^ - : ■- 

. v ;^212>;.)p^a : ; ..Z^. ." - -.v.- ' \:_7.7' 

: f <?2M--^ v^^TV.v' • , . • " 

J :<221> raijsc^eattnre " ; * "/ "; "] . ;• [\ \ . j ^ ' 

" <222> a) . . (142) K 5 -\ ./ • • . ' - 7 . 7 ";\'| 

<223> n » A,T,C or G 

<4oo> 1 : 

cacccaagaa ctaagaaaca aagggagaat gtacttttgt agcttagata agcaatgaat 60 
cagtaaagga ctgatctact tgctccacca cccctccctt aataataaca tttactgtrin 120 
atttcctggg cctaagactc ta ' . , 142 

X ; <2i2> ( *'p^';^ ^ ",'V«.^ ■ "'.i /TC^'v*'.' 

<213> Homo sapieh - — ■ *; ' ■ 

<220> 

<22l> misc_feature , "' f V 

<222> (1)...(331> ;i ^ ji. 

<223> n = A,T,C or G - ; • ^ 



<400> 2 



<2i6> 3^ \; 
<211>^ ili2 : ' ' 
;^l2>'*DjHA ' 
*<213> Honio. sapien 



60 



^gcgagcaga caacataatt tatttccaga aaacaacaga aatgaacatc . ateat^faata 

catgaaatcg gctgtgatgt gtgaactgct aagggccaaa tgaac^ttt^ ragagc:agtg 120 

ggcacaatgt ttacaatgta tgtgtatgtc actttcggta ccfe^rtg^atg cat^gggacg 180 

tgctgaaccc gaaaaaaagt gcctttccat aaggactgca atanagaggg caatttaccc 240 

tggtggtaca cggaacctajn attcactcct gccatgcctt gceaatagta anctgcaggg 300 
tggaacaaga aatcacttgc tctg^gggga a , 



331 
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<220> . 
<221> misc_f eature 
<222> (I)... (1112) 
<223> n = A,T,C or G 

<400> 3 . ; : 

ccnnnnnnnn ntncntnnnn ncnnnnccnn ngnnnnnctn gcccririncng ctnnriCGCcn 60 

nnthctnntn gntnangnnc ngaanccgcn nnnngnnnnn acnatnntnn gncgnnnnnt 120 

tcgttnnnnc ntgnntccnc nnnnnctngt ncnnnnnggn ggngcgcncc nccnancctn 180 

cctcnntgnn ncnnnctnnt nnctnngctg ngtctcncng cncngngcnn nnnggggtct 240 

nccgtnctnc nnnnncnnng ttttangncn gnaanacgcc gcgncgagct tttagccatg 300 

ggggataacc gaaccaaacn tnacactctc agaggatcca cctritgggtg eaagcgaaac 360 

tngancnatc tatactctcg anggtncaag gacattgntg agagaaatgg anncacagcc 420 

cacgttcatt gggtangaga ctccnattaa natttctgtc tccccngatg ggccctagac 480 

ccatgaatcc ctattangat cccntcagcg gqcanacncn gjtggctccnc ctgtaatccc 540 

ccacntcggg aggctgatga gggcgaatcc aaggtcagga aathtatata gacncctggc 600 

taaccggnga accccccctc taaaancaaa aaaaaanncc nncnngtntt tanagggngt €S0 

tntttttcnt cgccncgccc gncncgnccg cttnctngct ccncctgnnc nnncritcect 720 

ncnnenntgn tcancccngc gnnncgcnnc ntnccttnnt gngtctggtc hpncttcnnc 780 

ctctcttncn ccnntgtccn tngctctcag ccnctgcccc nccctnnccn tnngtgnnnc 840 

cnccntnatg nccncncnan aggngcangc nntggcncgc tgncchntgt ntgtcnetcn 900 

acgganantg nactcncnac tnngnnacgc natnnnanct ctgctctcag atgacagcan 960 

cggnntnnnc ngcctetanc nncngnnncn nagccnncga nnnaggnanc cgcgnticant 1020 

cnnntttcnc tctncnntng catntctgat ngccgtgnct ncctcnnt tn ctcnagcricn 1080 

tnnccacctc tcgtttagnc nctnnncnna nn 1112 

<210> 4 
<211> 183 

<212> DNA . ,". .. 

<213> Homo sapien 



60 



<400> 4 

aaaactatga attccatact tgaggtttcc cagccaattg ctcccttctg cttta^aagt 
gactaggtac tgagagtaca aacactccca ctttataatg aaggcgtcat gtcacccctt 120 
cctttacagg tcctggggtc caggagaccc agaatgaagg tgtcagttgg gcatgaagtg 180 
tta «3 

<2io> 5 

<211> 1092 

<212> DNA . ; ; 

<213> Homo sapien 

<2?o> _ ... . : . ; , . _ t . ... . 

<22l> misc_f eature. ; ^ 

: <222> (i) .X1092) . ;.' ; ; JJ ; 

.. n = a,t.c & g ;^; : ! ' .. ] 

ttncagacca agaagacttg atnagctgaa acccatt^ch i ctacttggaa ^tgatcngc 60 

aaaagctgcc tcagtcanac accggggata aatctggatt tgggttccgg cgtcaaggtg 120 

aanatnatac ctantaanga acnctgtaca ntgccncaag cangtganga ccncccacga 180 

gtttacatria atacaatnct gaaacnacnc aggctggttt tatatctaca tatttgactt 240 

accactatcn cantaaagtt tngcaccttt cnccgaacga aaanaacccc cchtnntgnn 300 

ttcttttnaa aanaccntng nnccncnttn ccgtcncncc ccnriatahtn ^ rinchrlatccc 360 

cccctctncc nntccntnnn egtaannggc gtngcttntg cngtntntgt cccgttttcc 420 
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tccgcttngt cntttiitcta tatnggctnn tnttatnccn ngcccttcgt cncctnnngn 480 
1 1 cgt ctg tn' cnt acjjt cctc n tnc t ngagc cccan't tgnt - act t cnngct. r t cnnc tccgc . " 540 
attccnti£tc r cgcncnrianc nennntctca nannatgiinc hntnnctncn nccnatncnc 600 
cctnanagnt telnet agac cntcriacntt ; gtntcccgnn ctcttagngn tet^ctneta - 660 
gtgtritnndt ca tctcctct net tctct ct cdt ttgacne ngnncnctcc atcht riritct ; 720 
gnctttctca ; t'c^chhhring cccctnctcn 'cnriagtntgri gtgenennne ttnnnntcna <a.. rl 780. 
rictngtege^^ aqtnhnnccri : nrigcnghnGg nnngctdttt ctritenntta . 840 

gactnacctt :? ntctgnhnhn tcannctagc hctgt cent c tctnritctgc atcrittanae . 900 
atcttirtfit : cn v cccridtcgcaV nentnetntt hacrictcrica tacgtthecn ^nnct'eagtcd..* - ; 960 
gcagnnnngt tncntncrigt cntctcgc^n ctcnnhtcct ctctnnriacn cncctggtdt . 1 1020 
ncgnctcgct ccnncccatn cntncctcgt tgntcnnnnt ennataegtn tncangccnc 1080 
ntctctccnc tn 1092 

<2i0> 6 

<211> 504 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_f eature 
<*222> (1) . . . (504) 
<223> n - A,T,C or G 

<A0y>> 6 * .' ; ^' '7" • r ] \ - : : ;^c. ; v : ;. 

ctgga%cgg9 4 atcattta 60 
ecattggfea^f ca^a^ctgag acctgtgtct cacatcagcc* taggtgaagc etactacaaa : ^ 120 
taatgccaag: ^gagaahagc cagtacacta tatggtttat "actctttatc- cctttattca : : 180 
tagdatgflJtt? tttaaaaatg^ttatattatg caacagatgt- gaggcagcan ctaagctatao v: : 240 
c 1 1 aagaa 1 1 - 1 1 c t c t dacd; 1 1 ccaaacca aag tgt cc t g- a a t aageeag ■ gagac t ta 1 1 ■ * 300 
ct 1 1 tgtgcar dec t ggtgba cat ctgactcj t tgtcctahc cariaaac t et ctgaggccac •.; 360 
tgaaagaaca g t ggcec t a t cga t 1 1 ca 1 1 cc t agg fcet c aaa a a t aena / ;t gtngcctt:g 420 
"taaca t aat t agggabagca 1 cctctat 1 1 c acaat t ataa ^t ct aaggtag gat aagacga . v 480 
cacagcagca : J aHaaaett:ac aajgf ■ ;i - r = - -5.;,.'. 504 

<210> 7 .-- ■; : ! 

<211> 1132 

<212> DNA 1 ' 

<213> Homo sapien 

<220> 

<221> niisc_feature 

<222> (1) . . . (1132) ^ 
<223> n = A,T,C or G • : i ; . <. 

<400> 7 - v 
. gcgngccccc tngtngnncn ttntneneng ttttctgctn tntttatnng aggnctnggt 60 

nnttnntctt agggnnntng tncggtcnng ttnntgttnc, gagcagaaag tgnatatttc 120: 

atgchgccaa ^cttntttiat tgaaaantcc taattntatt gnccgtntag ; taacatgttt v - 180 

gttchacaah gctaat^ tttcttataa gtngtataaa - 240 

t tat t tnat t t acagaa ac t tgt 1 1 caaaa -cana tgriac t a hn t a 1 1 1 ct nd t c 1 1 1 taa r 3 00 

atanccahac taattttcta tccctngaca tctgttcatg ttctatndag dagccaacac . . 360 

aaa^tccanc tgagagctct : tgattaangt gtncgnatta tctagGtact tcchacgttt ^ 420 

toggngenng aaatgricttt taanancctg gcctcaaaaa anaaaaanan -ccceccgnxih ^ 480 

aTOOSfhnttc cnthtanaaa aahggntcnc tenncchgtn. ngagactgtc tccGtgnntri r .> 540 

rignnnntege thtnatdahg ngccncnahg Gtdriccnt cn ctnnngcatt ngatniintan 600 
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cnnnctgaga tgngnntang ctgntnpntn ngtgtcntan gtctcgacgt tgnnpggntn 660 

tangnancgn cnntntnnnc ; nnat tgoega , gngnntaagt gtgctct tct cntnaratpt 720 

ntcnnnancn tctnhgajtst tnatacggcc gtgcttnctt atcnntgana ncg^tctnan 780 

nanntncgna tgagnntnfca ctgcricncnt gtgtcatctt tctctctant gt^nctima 840 

nncnngtnat tncgennnac tgntantnag tggtatnnag anntcgnncg cnngngccnn 900 
tttnnctgtn gnnatnagnt ntcanganat tna tncgtgatag apagntnagt ; v 960 

gruiggntctg actgatncgt gtcctagtnn cngtgacatc gnncgttann gtcngcactc 1020 

tagtanannt nagtnngang ntgtanatnn ntctcntgtt t cagtiuiagn cccncgag^ 108 0 

cntcanntnt nantgtctcn tctnngtcgt ,anncntgtcg agtngtnana nn 1132 

<210> 8 :;;-v . 7: 

<211> 736 

<212> DNA : 
<213> Homo sapien 

<220> ' ■ >:-■ : • . — ' 

*' <221> misc_feature 
<222> (1) . . . (736) 

<223> n = A,T,C or G ^ ; f 

<400> 8 , • • ' ... ' \ ^\r . 

ntgggcccga cgtcgcatgc tcccggncgn catggnnnnc tggtttggtc anatgtgaat 60 

aacgnagaan tgagaccacn ganaagaacc acantgtnan ggnncttgca cnt^tianga 120 

antnagnaat gcctttttnc ^tgagggentt nggnnntcat nnangggngt gnggi^^tt lftn 
ncacctgtaa t accaccact t ttncnatgcc -actgccngtg. natqaccngn qgtaaggacrt 
t caanaccag - cct t a tnaac nt gggnaaac cntntntcta ctaaaaatne ^fpiaantatc ,. 
tgngcnnngt ngngcgt t ct; tntannnccn gc tgnacnng ,angncngi>gn angnt^tcg 360 

cntgaacntg ncntgt tana gtngcantga gcctaaatca cantgatgta ttnncatctg 420 

ggacgaeacg ancngacgac tcncgtactn ; L aaaaaaaaaa ncccnttnijg ggggg^ttt 480 

tnnnggtatt anntatantt ggagaanttt "gggtcannng aatattntta^.^atga^ajat , x 540 

fiaggaataac tntatntgtg tacattgggt tnnaaanang acantantgig ^ n^cta^actn 600 

ttnggggngg aggggnnatt agggnnttaa ttnggnnnctj tnnaaannqn^t 660 

nanaanantn tttnnanaag ngnantngnt ttaaancctn aangnttnnn tnctnttann 720 

7-36 

ttnnaannnn anannn 
<210> 9 

<2X1> 690 i ^ " 

<212> DNA 

<213> Homo sapien 

<220> . •. • • 

<221> misc_f eature > 

<222> (1)...(690) 

<223> n - A,T,C or G 



- <V* 



180 
240 
300 



<400> 9 •• >•.'/'• --^v.v v -> • . . 

tnnncctggn t^cactcc ctt 60 

cagtattact gttttgctaa gccgcttcat teratgcetac acaatttttt tttaaaaggg 120 

aactttagtt aattaagtga taagggactt . aaatatgaat tariaatggtg cagaaeigaga ; 1B0 

taccttttct ggatatttta aagtttaaag gt cant t tct cttaatctga ttatgtgcac 240 

atatgaaaat ggcacatcat atacatgtaa aatcaggcag tatncaticta ttaattactg 300 

tatttgacaa aggaaactct ^ taaattataa tgtgaaacct ^^ttfeatg^aaccaat^c 360 

tagtgcanca tttcagcata tgcaaaaaaa, aaanncctntv ( tgghgngctg tttacaaagg 420 

aaattgttgg atttcacgat ggtttcagga naanaaggtt ttcntcatcn agggtaaach 480 

tcccggataa ggcntngntt taatntnntt annccnnccn athgntaann gtggaaatta 540 
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ancctctgaa naaaanancc cacntnnttn gccttgggct tnantctntt tggcngnanc 600 
naaaggnnct tnccaggt ht_ cntgnngggc cngnngaann ataanriaann nggggnrictt 660 
nggaaacctt ncnnriaahan : thcccncccc ''• • . 1 . 690 

<2i6> 10 

<211> 395 ■ '. ' V • - * • 

: <2I3t> Homo sapien 1 . 

<220> ' ' ' ' " ' . . ... 

<221> tnisc_feature 
<222> (1)...(395) 

<223> n « A,T,C or G ' - 



<400> 10 . 

tggtatctga cnnaataaga atgcacccat ttgtgagggg taatatttat ctcangattt 60 

actgtaaata tgtatacaca catacaaaaa cccaggcatt gttaagagaa aatnat^gcc 120 

cagagfottna aattatcaga cagaaccttt aanaataatt atgattaatg tgttaaaatt 180 

ctagtggaaa agataaataa catgctcagg anattttagc anagagatag aaactathtn 240 

ngaagctcaa atgaaaatgc taggaaatga aaagcagtat tggaggtgaa agattccttt 306 

ggcaatttat caacanactg gagatggcan aggcataatc agtantattg a^ggcagatt 360 

actatntatt atncaancaa aaaaaaaaac cccct 395 

'<2ll> 331 ' • >>r- ' ■ \ ,.. 

<212> DNA . ■ : "■. .. "■ ' . .. 

<213> Homo sapien • j - 1 

<220> 

<221> misc_feature 
<222> (1) . . . (331) 
' " <223> n *= A,T,C or G 

<400> 11 

aacgaggccn ngaggccaat gaggccaaca agacgatgcc ggagacccca actggggact 60 

cagacccgca acctgctcct aaaaaaatga aaacatctga gtcctcgacn atactagtgg 120 

ntegctacag gagggaacgt gaaaagaaca tctccagagg aactCfgtgaa tgaccacgcc 180 

cgagagaaca gaatcaaccc cgaccaaatg gaggaggagg : aattcataga aataacgact 240 

gaaagaccta aaaagtagca agaagctaca tccctcaaao ttcggcaatg aaaataaagt 300 

ttgagaagct caaaaaaaaa aanccctttt g < 331 

*"* <2fQ>;:l2 * : '' ? - *■ 11 ; : "- *" '•*■'* - .-' •• ..■ !'.■ y.-..s.-; t .' : ' 

' : - <213> ; HcmiO sapien -' 5 : ■ V '.■-.?•,■ : v • ho.: : • ■ x . 

<220> 

<221> miscjEeature 
<222> (1) ... (693) 
<223> n = A,T,C or G 



<400> 12 

tncaacgcgt tgggagctnt nccaaggtgg nctagcrmca ttaatgccct accgtgggaa 60 

tatggntgaa gatcttgact aggggactta tgaacccatg cagccgtgcc caaatcctac 120 

caaactgacc ttactttctt gaagacggaa ttgtagtatg gtcgagctca i tgctttttgt 180 
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agtaggccat ncaaattcga ttgactggct aaaaaagatt gttagtggag gct^gaagaa . 240 

acattttggc tgatgataga tgaatagagc ttggaacaat c^a^^ai;agc:ag^aa3^ 300 

ctatacctat tcataagaaa aagttagtat gtttaccgaa cattatnaaa gaattatgac 360 

attttcaaag ttttaaaatt ttattttgta gggacggggt ctcattgtgt agcccacnct 420 

ggtctgtttc ttgaggattt actatanact gggctgtatt caaagcattg gggatacagg 480 

catgaatgag cccccattgc ctgaacttac cattcaatct gggcagtgaa ^aanaggga 540 

tgntgggaga nccttacaaa gatgaaatgt cgctaactgg agaaatccct, actttcagtc 600 

agactgaann ggaacaggta gtnactgtgg gtagccctct ttgggnangg gtngattttc 660 

cacatgtgcc cagttaaggg ccnagaacat taa 693 

<210> 13 ' ] > ; ; • , 

<211> 305 

<212> DNA 

<213> Homo sapien 

<220> >?.*-:;,;.■ 7:-x- ,.. : : •• • . • • v'.-.v ., ,- ^. . .: ' 

- * ic22i> !misc_^feature .-/v • ^Wi 

,? <222> ( 305) ^ ••• l-^;^" 'v.--.--''--" 

<223> h = A,T # G r or G • ... .* \ t -" 

<400> 13 . . . .•• • . •. : ; ' f \:^( ;'<0 

ttggtatcng gggatgggng aggggagata gncccgaagc atcccnnatt ctcagtaaac 60 

tccttggnat canannatat cntggccnaa gaaccncnca ccntctntgg gtta^aaata 120 

ccgctntatn gngtatgagg ggatngggcn tacgnnataa tttnctatng ganggtattn 180 

ccgcactant gacnagttct ttctnnggtc catttnnaac nacantnttg acat tgntga 240 

tctgcaannc tgtaaaatag tcttncagtg ggcaatnnnt gcacaactgg gttnggtntc 300 

anaca 305 

<210> 14 - 
<211> 308 •. >/ . 

- <212> DNA 

<213> Homo sapien 

^c220> , .; '% .;: • . ;:. ; 

:<221> misc^feature • : ;* : ■ ^\r-:. : \ u .>r - 

W222> (1) .,- , C308) >: , ^ V..-..:' j- 

.;:i:^223>.':n / = A,T^C or .Q;, : - ^ /-^v- -.'^ . ' \ t. , '.^ 

<400> 14 .; \.' W<; v.:- : ^,>V/, 

agcagacaac ntaatccaag ccatttacca aataantata tgcgatgcac attgaatcct 60 

ggcgctctag atatantgcc ccaaaggaaa gagnacaaag tnttccnccc ntagfct^ctac 120 

natgnctatc cnctatcacc tnctgnttcn naagntttnt aaaaataaat tctcttgtat 180 

ancatccnat atcncaccgg tccaaagcgc aacaatctgc aattcanaan ttcc^acaat 240 

cnatntatgn actttcntag gtccggtgtt ctaanatnta atattctaaq c acttactctc 300 

agatctta 308 

<210> 15 a> - : U- 4 . . 

<211> 304 •• ■ . 

: <212> DNA •:■ • V • " 

<2l3> Homo sapien 

:. <220> . • ■■ • • • • . ;: : ■ . r .v--\V ..■^•T\ 

<22i> misc_feature \ i - ; . ; ;;i-> ••' : i 

<222>- (1) . - .(304) / ; r 

<223> n - A,T r C or G 
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<400> 15 

ngtnaaggga t at t tat tec tgttttaaaa ggatacaacc aaggtaggga agercttegtt 
attggtgatt attcagaaga cctattttct ttacatatgc tatggaaaca atactgtttt 
ccgctacaga atacagttta tgattatact .tttgtaaatt gectgetttt cecct^tcat 
ctgetaatte caatttgata, ctgttctgtgVt^caaaaat^c^ ^agctgtaafe : 
ggtgcctgtc gagagtccca gctgcttggg; ggg6tM 



60 
120 
180 
240 
300 
304 



<210> Ij6 

<211> 7Q3 

it <2}2> DNA 

,*2i3> Homo sapien 



<22p> ;;' . ■;. . 
<22l> misc_feature 
<222> (1) ...(703) 
<223> n = A,T,C or G 



<400> i6 v ' • 

ccggtngnct aaaaaggacc agectaatgt agaaggtggg tatttgejacq agaggcttta 

gattattatt ttagatccta catatacttt tatcagtaga atgatttcat tria<ga'tytat 

aatgaaaaag ggtaatgcaa aaattatgta atagatacca aattagggaa gtttggcaat 

ttcaatggca tatttttagt caaggnacac agatggcagt gecataagea agtctataaa 

tateggctge agccatcccc ctcattttaa atgttgccct aataatcaat ^pagrttaaca 

agtatattgg ctgtgtgtca tgaaatagtt catgttcaga tggaaatfgtt a'ggtt;aVtgt 

atggtttatg gagattaatg aaaatgaatg cccaaaaaaa aaahriccntt trigrtggriggg 

tttnnnangn acngggctgg attcaaanca ttggggatnc angnttnaat ^ignccccat 

ttgrictnaac ttaccttnna nnntgggcnn tnnatngaan angggatnnt ^ gfcjganlfiaacc 

1 1 t nnangnt nnaan tgt nn net t act ggn ^' ;^^adiuuicc nlf 1 1 n rmriirit: hnruin 

ngnaangcjgg naannhnnnn ntnajicttiniE 'ggg^^^pn '-nt^nt'^g^' iari^ggg^^tit^ 

rinttnnnipen tnirnggccnn nnnnggiggcri rina^antttt: tgn 

^^<^io>..i7 ,.!". 7 -v-"' 77 7.7' 7, i777-.--' • - ■ • ~'7 ■ " ■'■ 
'<2i2>.pNA" : y / . , , " * *, T'v :: "-77.- 7'- ' ; ' '7777 

<2l3> ; Hbmo . sapien \ /-.i,-'^ \ .J .1' . 1/ ' .1 ! 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
703 



<220>. 



. <22l> mis cofeature 
<222> (i) . . . (171) 
<223> n = A,T,C or G 



<400> 17 

tccgcntcta agtaattcat caataacgea tgtccactta atgtgaaaat' tggtaccatc .60 

taatanaatc ttcaacatgg cnatccacmc tattccaata atgaaatgea alatttccctg 120 

ccttctttac tanggtcatt tntagattct tgaggaatga gttctactct t 171 



<210> 18 

<21l> 689 

<212> DNA 

<213> Homo sapien 



<220> 

<221> misc feature 
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60 
120 



<222> <1)...<689) 
<223> n = A,T,C or G ...... 

.<4ob> l'au [ ' ' : - ! " ,^.'' ."1]. ',. . I ' v .^/'.r 

aritnngcttfi ggtactaagc agraatc^ctt: hcttgggaac tcfcatgtadcrthg^ 
tgtgatt^aia atagcateag taaaii^^ 

accnggctat %-ctgacttt g£gct£c£ca ^aeacfcdtle fc^iriafccaaa ^ 

cct^gannac ctcanggggt canatgtttg aaggagctgc tgagtatcct ggcaggcanc - r 240 

anagccttac catcagtttg ctgcatggaa ggctgtgtgc ctctatttcc ctgctatttg 300 

ttgaactccc ttgagctccg gtccttccta agtgagagag atgatcccaa tagcriccaac 360 

ctgagagggc tggggagatg ttngaaggaa agcttggctg gggagctgaa Wctggcqrtgt 420 

ggtacatgct tggtaactgg tggccaggan acccgggngt gtgtnctggg abtgtGnqac 480 

tctgctgacc agggtattga aagtccccnc tcaaanacac agaatntntc t^accaag^g 540 

tangtatgan atgacntgtg gagcactttg nataaactgg ttetcatngg nggtcccctt 600 

gaanaggtgc tnnatctgtt caaaaatacg tggctgagct ntanacccng natcctctgt 660 

cagagacatg ggcaggggga ctcaatgct ' 689 

<210> 15 
<211> 721 

<212> DNA . v.- >'{■-.-.; ■ ? V I 

<2i3>^faxho sa'pieh ' ;'/'^; ' - ['■'[■' \. ; .' - ~ ^/''C\^/''^V ' 

^<230>-.;. \.\r'V',.\'.' -. ' , : \:^>V\V/\ 

^'<221>;inisc^feature /; : V~ ' '7;- " "''"v -.V'"^. ".' 

tatanatifcj^ • ■ 60 

tcagc<^cbt ;tacbttctac ti^gtacct^ 120 

-tganOT^^'-^^ cc caa"g^' gac^aanccri acatgc^ ; 180 

tgctcattca acagtcttgc attcagtaig^ l:^ttgacat cacctactat ^gncaggc± 240 

ctatgctang nactggggat acaggagaga ntnaagcgta aagtctttgg tctcaaggaa 300 

tttgcattct agaaagtcta agatgtaata aatgtactgt gggacatgtt .aaatMjfctge 360 

tataacgaaa tataaagggt ttgggagcaa aaaanaaacc cnnttgtggg gntctritncc 420 

nctctgatga agcttactta cttttaacct tnccttctcc tttaaaggtg tttcctpgtt 480 

cccctttcct ttacagattg gttattggtc ttgctgagga gtaggaCtac aattnccigc 540 

attctnctgg aagccaaagc tgtgctacaa ttgnnccaaa gaagatngta atcttaagcg 600 

cccntaatgg taaaatngta ttaaaangtg gacctttgac aaataaattg nttcg^fcttc 660 

ngaattccgg gttngnagct tngngntncc aaaaaccctt nggggriitccc ttttgggcac 720 



721 



<210> 20 

<211> 248 ,. J /, . , ^ . 

<2i2> wja ; \ . . ' ' . . ; ; f r . . ' / ' ^. '/",•...'' .V ./ 

<213> Homo sapien -. ; ' Z , ".*\" 

<220> ; .... 

<221> misc_feature ^ ■ ■ H V 

<222> (i)...(248) 

<223> n = A,T # C or G 
<400> 20 

cttaaacacc ccncccatct ncnccccaga atgagntaan catactcntc nntactgfhat 60 
ctccgtatcc gtccctacnc nggnttgtga ggtgtcatta gcrigatatta ctcctcatcn 120 
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ncatcntgan cannatcccc. catcnnccat , atgntgatna nnacaaacca tnctattncg 180 
ccgnngaagc crintchnttc attggattcn ; tagaecgcan angtcctnat tcngacacrig > . 240 
aatcg^tar. ; ' '• \ ;>■■'' -' "■ M -' 1 " '* • • 248 

<2io>; ! 2i. x " J : "- : ( '- ' - ' 'h.:':\l<:> : 

<2il>::'2^8. " *■ V ; : . ' ! >>^ v :>'"' : .■; »■* . V-"" :--^V 

<21^>JDRA '■' *V : .' ; ■'''*'.•■'"..*■*■:*■■'■ ' ' r ' ' ' 

<213> .Homo sapieh ' -* ; ' • 

^220>- ' ;*■_:■ ' /' *v v -- \ ; -v- -p'- :< *»,:>:, r < ^/:. •••■/:; 

<221> ,misc_f eatiire *v ':-j-;V. i V' * -'A ■■>'■* , r- ( "V s ><""- 

<222* (1) iV:"(29'8» ■• " *■ " " *f * ; • - v V •'■•«: ^ 

<223> n = A,T,C or G : ■ ' r ' : ' : : • 

<400> 21 \ 

ggtctaaggg atgtgatgng agcatagaat ttanctntat ggncatanta gggacatntg 60 

ctgatntacri tggnctgcgg tcnntgaaag gtggngnatg atgactgatg tcatnagtag 12Q 

tacnanggac tncgnnanct gggatcnggg nttacnttgt tcatrtgtnagnagtgnnanch 180 

aagtanatgn taggnataaa gatgttncgg gagatgggtc tacaaantct tttnaagatg 240 

ntcatcttga anannatcaa gtgtgnttgg tataatgact atcattatac aatgteaa 298 



<210> 22 

<211> 591 ! * r £ 

<212> DNA 

<213> Homo sapien ^ 

<220>>; " .. :. r. ■/■•:.■...:_:-;..■ 

■-<221> : "-irtisc[_fea't\ire \ : k * • :-.->'.- "-*-r---v ■ ■ . j^r. - 

<222><: (1) . . . <591) fr,x. , — o..- . .v -'/A \ 

<223>- n ■= A/T,C or G ' ' ' f * - -v. .' .'■ ^i\r; ; v. - 1 ' ^..ri -r 

',<400>"22 ^ :.^x "■■"■-v.* . ' ,>..;". yJ^-O 

tcgctagaht' actattcggc- cgcaacxjggg agcctgaitga^ i ggacgcttattgatatgagga;^^^^.;. 60 
aagcactttc cagggatact gagaagaaat ccatcatacc ' .attaccteat. Gctgtgaggc.. 120 
ctgaagacat tgaataaccc tgggcagtgg ttcttaggca gatactctag atgctttatg 180 
gacaatatta ttttcattgg atgattctgg agctctatta ggagaaaagt aatcatttta 240 
ggtcttaaag acttcaagaa aatacaggtt atcaatttat tttaaatctc attgtttcca 300 
gttagcaata tcatacctat taaagctgtt cattgtaaca aaattcaatc-'aaaaaggcag 360 
ctaggtcaga aggaaacata ccactctcat ggttcatagt attcactgta :tgtatgctag 420 
ggaaaagact tgctccagtc tcctcctcag ttctgtgcct gagaaccact gctgcatata 480 
tttgttttta aattttgtat tgaactgtta attgaagctt taaaagcata tatgaaatgt 540 
ataaatctaa gatgtataat acattattga ccccaaaaaa aaaaacccct .b . ; . 591 

<210> 23 / ; 

<211> 755 

<:212> DNA ^ >: . 

\:<2-13>'HomQ^^apien. ' . - * ' < ^; ••" :■ ;- 

<22Q> .■ - i * : - / • ". * : : ■■ y : --'*±*' v ^<rrl 

^<i21>*misfc_featUre' -■.^V--- :»'.'■■*■. ■•' • 

■ <2i2> (i)V..v(755) * --. • 

<223> n = ^T^C.or G r-", - > ^ ^ 

<400> -23 -' ; . ^ r-\ 

gnnnnrihgt t nrinnagcngg^ t tnggtncng actcccnt 1 1 at natgaggg acactgaggc , . 60 
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ttcaagagat taggagactt: gttcaaag^ aggcaggratt 120 

taaacctggg tttcactgca ttt^ccatta ctggctttta gccatgatgc tctactgtgt 180 

aaccctctta attcttgacc tgtggctata aagtatgtat tgagagacag gccctccctgj 240 

agataacttt ccagccttga caaaggcaca cccttggttc attccttgga gtgtaggacc 300 

tagattgtga caagcccaga tgagtgtgtc tggcagaggg gagcagatct gaggccacca 360 

tatgtgttca cctagcccta aggagtgcca gcttcgctgg tatttgtaca .gctziccatca 420 

ggaetgctca ttggccacgt tctttcctct ccctgccacg ttgatt^ata ctcacataaa 480 

ttaatgctca cattagtgtt caagtatgca aatgagtgct taaaatcatc aotcacacaa 54 0 

tgaccagact gaggatataa cacacaagag cccctctcct ggtaacccca caatcatgca 600 

gatgtgttga cttctctgca ttaccagtct ggtaggcagg gggatatgae J ^gttagaaac 660 

agtctttcan acagcagttc tcaacaccag gtcccttgct gcacaatcga atcacctggg 720 

ggtttaaaaa aatatcatgc cagtcagcca cnntt - ?55 

<210> 24 - 

. <212* DNA . v ' ;■ H> v ^ ... ; -v. .J =v .• \ ^i:^.- ^''. : ;^: v : 

7 <213> Homo sapien .-=■>;.: ri>.v : ;..: : : • :. ...... • • - . ' > J*-//.' ;. ; .;• ' 

<220> --. : ' ." *U..:.VCJ , ' y .■■ :.'-:>.•■- V" 

<221> misc_feature 

<222> (1) ... (513) Of.. 
<223> n = AJ,C or G 

<400> 24 ■ / . -:-:i"; : "V' 

ctttctaccc aacaagcata gaatatacat tgtatacatc agaaacacgg gacattctcc 60 

aaaatagacc atatgatagg gcacaaaaca agtctcagta aatttaagaa aatcagaatt 120 

atatcaagta ctctctcaga ccacagtgga ataaaattgg aaattaattc cgaaaggaac 180 

actcaaaagc atgcaaatac atggtaatta aataacctac tcctgaatga tt;gttgggtc 240 

nacaatgata tcaagaggga aatttaaaaa ttctttgaac tgaacgataa tagtgacaca 300 

gcctatcaaa aactctggga tacagcaaaa gtggaggtaa gaagaaaatt catageattra 360 

aatgcctata tcaaaaatct gaaagagcac aaataaacaa tctaaggtca ccctcncaga 420 

attggagaaa ctagaacagt ecaaatccaa acccngoaga ; agaaaagaaa ,= taaccaaatc 480 
cgaacaaaac taaatgaatk gaaaaaaatc ;ccc ■ ^? ■ : 513 

<21^ 25 - : r,.-> ■yv: i -:v.^: : r»-''^- - • . :;v ? : . ; V : ■" — " ! '■" ■- • ' ' 

• <2i*> -57,4 ? ;■: ■ : ; . - - : " ; >iv ; '-^-v • ; 

: ." . <213> MHomo sapien r ' •' ? >£-v.: : 

- ' :. 5 v <220>-- 'y :-■ ■ v/ ; " .>" : v-^ : ;- • : ! - : . 

<22i> misc^featufe • - : - v " 

<222> (1) ... (5^74) 

<223> n = A,T,C or G J: 

<400> 25 ^ v • - 

cgatccaaga gattagaanc ccntggagtg gagcatgctt cnctanaatni ^^ccacctgatn 60 

cttggctnaa riacantnngc tctantttgc tttgtgcccg tccacacaan ctaaaaacaa 120 

gggatggggg gaccncnagt gtctaatatn cntaatatcc ntccncnggc aaatga^atac 180 

tttttacaca cttgtanntt ntggagggan ggggtnatna tgaggggaan gggaaaggat 240 

gaggagaaat ccaggatnan angtctcttc gtcctctcna gactncctca cactctntgt 300 

ggtnaccngg gttcgttntg tceaatggca gacat tat ac tccat ante t acccnggctt 360 

nntcgggttg ggacgecann actcccccna gtngtnnccc ccnancagcn atacacaagt 420 

ntgaacgggt tttgtggcca ntcatcgcaa tgaccttntc ctcnactcna agaaaantaa 480 

acccGttccc cengattggt ttctaaatet ttcaccccat ctaaaataga aagcnc1:nag 540 

tgggangggt tnatcccccc nttaccntta aaac 574 
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<210> ; 26 .'. ( .. ;*. ; . ( -." ' ; ' ."*""... V 

<212> DNA ' ' " '" " * Vi T ' ' >: " ^ " V 1 

<2l3> Homo sapien 

" <22o> ; V-.-.. 

? <221> <nisc_feature \ ~ 

<222> (1) ... (185) < 
<223> n « A,T,C or G 

<400> 26 'V/ .;*■■, "'* /[}:, 

gnacnattgg caatgacnga aagaatttga angatgnaca agtna'aagnn acagtggcaa .60 

agaatcttcn gggcgcgtca aaacaattgg gtgnattaag gacaanetcg gtbaricagta 120 

taanctctct ttcncgngga ttahtngnca taatcatnat tctgacnngt aggacattnc i80 

-caacc . ... 185 

<2ii> 270 . • : ' 

\'<2Xo>L^'/ ;';Vi"Y Y' \ Y\ ' '..'V. *, :-:^- :V "-."V ' '/ ; J ,J "" ' : ■ 

<223> n = A,T,C or G >;: '"-° * '" •'<'-* v ' — ' • 

<400> 27 

ttctggggct ctatacaggc tcctatttng atccangegt gctgatgagt ■gcacagcacg 60 

atcacatctg gaaaccacca ntaccaccac cactacgcac ntcaccaaaa etgtganagg 120 

gggcatttca gagacaanaa ttgaaaancg aatagtcntc acggg^gnat gcariaczattg 180 

accatgacca ggcgctggct caggcagnta aagaggccan agatcaacac cctgacatgt 240 

cngtgaccag agtggtggtc cttacanaga . . , 270 

<2io>.28 C"\Y... 

<211> 758 -° j \ 

<212> DNA 

,<213> Homo sapien .... .. t , . / . 

tgctaggtan aaagttacct otaagggaag ctctgda^aa i ^aatca^tg Saata^ 60 

aaagccgc^a ttacaatcaa gaggaaccta cttccctcct ggcaaagaaa cccaaggaag 120 

gcgagcggaa gatttacttg gcaattgaaa gtgccaatga actggctgtg cfagaaiagcaa 180 

aggcagaaat caccaggctc ataaaagaag agctgatccg gctgcaaaat; tcataccaac 240 

caacaaataa aggaagatac aaagtcttat agacatccgg aaaaaagatt ,4 tttacctgt:g 300 

ctggtctatg atgtatgtgg cagttgctgt ctgcagttta caatgtdttg tnaatgaaga 360 

ttttttaaat tctatcttgc tgattttttt taaatataan aaactggtac ttggtaaaga 420 

aatctgtccg taattncccc ccaatcagtc caactatatt taaagccacc tgttttcnaa 480 

ttttgatntc ctttaatgtt nactccaata tccatatttt aaatgtcccg gataatatcc 540 

caaaggttta aaaaatggaa atntttgaac ttcnrtttgaa nanaataaat tcccatcctt 600 



WOOO/2213^ l^I7US99/24222 

12 



tangggntnt ccccttnccc gttcttccaa gaaatgtgac cttccccaaa aaagntnatc 660 
cctanctttt tgnttccccc etgantttct gancccggac antnacgggt ttaaaanttt 720 
ttaaattttc caanncaaaa aaccntntnn ttttttna 758 

<21G> 29 

<211> 577 , 
<212> DNA ^ ; 

<213> Homo sapien , > " 

<220> 

<22l> misc_f eature 

<222> (1) . . . (57?) v .. ......... , ,. . . r . ... .. • ^ .V./. . 

<223>'.n.',« \&f%£. or, .6 : r^ : .^;"^ v -l;' ; . " ' . \ , : .^.Z,-.'^' Z ZZ. . \ % 

<400> 29 •-" •"' v 

ctgctaggta ntaanattat ggatccacat tgtnctgagg anacgaanat acttgctgct 60 

gatntfaggtg aaaacgatat tgatccntct ggggttttac ggtgtgcact gggtgctgca 120 

cnnacttgtc aaggtttgnt acgtcctctg ggcatctgca aaaggccctg ^ctctf^gag 180 

tgttgtatgt agtgtaccaa aanagtattt atacatccca ccaatcaaaa q|fca^ctt:tn 240 

ttacctcatg cgaactcatn caaaccaata gaathtcaac atgttct^a cqtt^na^tg 300 

ctcacttact acctctgaac natactcacg ctgtnntttg tctctthctt atcttitttgc 360 

ntcttgtaat taactctttg tttcccttca tcaaatgtaa tgtanatcgt gatctattaa 420 

aanaaaaatc anggttgcac ttgctacttt naanaaaccg antgtggaaa catt^ggtct 480 

naattcacac aggatcngta naactgttgt ggatactgag aaaOTtttga atgtc^ctcc 540 

ccttattacc atcccgcaaa aaaacccctn tnntttt \" 577 

<210> 30 

<c211>,449 ..... . . . , ]r: ... 

<220> " ' ' ' ' ' " . ." '-.yZ- -ZZZZZZ . 

<221> misc_f eature 
<222> (1) ...(449) 

<223> n = A,T,C or G ;l 

<400> 30 ' • . . .. v ., * r //i ' .^V^, " 

tttacccaat aanntatagg cgatagaatt gatacctggc gcaatagata tagtaccgca 60 

aggganagat gaaaaattat aacnaagcat aatatagcaa ggactaaccc ctatn^cttn 120 

tgcataatga attaactaga aataactttg caaggagagc caaagctaan . ^accnc^raaa 180 

ccagacgagc tacctangaa cagctaaaag agcacacccg tctafcgtagc anaat^tgg 240 

gaagatttat aggtagaggc gacaaaccta ccgagcctgg tgatagptgg ttgtcp^aga 300 

tagaatctta gttcaacttt aaatttgccc acariaaccct ataaatcccc ttgtaaattt 360 

aactgttagt ccaaagagga acagctcttt ggacactagg aaaaaacctt gtagagagag 420 

tcataaaaaa aanccctntn gggnnnngn ; , - >.-,. ... 449 

^.'-<2io> ;.3i 1 C.."'-- t ', Y . ]: Z.'- . -Z^'Z^Z- ZZZ- ^ 

<2ii^ spC/ r * Z.ZZZZZ. , ZZZ-'-'. : ZZ< Z' 

t , : \\^2l2^ : DNA .'J;'-.;?'; ..',*,- 0 . Z ZZZ* "; "''"V '■ >■ Z-- ' • ZZZj y=<. 
" <213> HpiBO.-4^pien^. ■ ; ->- ■ \> : - . ■/ t . - : i.-^/^. 

<220>..^ ■ : / V"; , . 'T. ;-;v 

<22 ; l^^isc^f^tiire. , ? : . .. - ^ : \ _ ;^. r Z-.. 

, <222> (1) ? > * (50Q) \ ; . : ... v^. , ' 

<223> n - AJ.c'or (3 
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<400> 31 

tcntggaccc nggftccccnn gngancaaan aagaagggcn ngnttncatn gaaaancctg 60 

tgattntcgc cccggtncag gtgttnannt atggcecncn cncatctggt atacgcchaa 120 

acaatntant tttacaatnri gtnccccane aaacaangtt cgtnghrittn actaggtagt 180 

taatcccncc ccatgttcaa ataaagggcc cgcgntncna ataaggaahc cnccccgaint 240 

gcjggtccccg aggccctctc cttcataaaa rincattcaac ttccctcccn ctannaaagn 300 

aattnttcna atttttnaaa cactccctgt ccanggggac tttnecccca ntanctgaaa 360 

aaatngcritg ac^tGc^ct tcggcctaag g^chcaactt an€thhcccc caanaccegn 420 

gggnnaggnin naaact cccc t ngaagggaa^ cnac tcgcht aaiaaarigg^^ V taa tcncccc .480 

cnaattatte c •> ; - : -\- /..•'■ \ \" * 500 

<2i6> ^2 J: "";--' 1 • " ' ' • • . / ;~ : :*V' '.*••■' ' . 

' <212> MA : ' -> : ' : -" "" ; . : i: A- : - ■ * . 

.\ <213>- Hdaky sapien . V >;r ; : - 1 '• :;V --- ' /^'V-.-- . - : . 

''<2^6^; : '^- . 7 " ' " ; : ; : K H^- a : '- a' : ''/ \ ;>^>~,>v:; 

';<22i> £ 'irix^c^fejatur^ r: - * ■>-•;*■;• /• M ri ~< ' . • 

<222> <ij. ; |42^6) : ;.: : _ ' ' V :; ' ; v ?: ?;V:--- : -"." : -Or\. 

", <^23> h ■ s A,T^C tfr G v "' ' / ; X" .; ' " : f 

<400> 32 

gtctatgatc acatctgacg ctattcctat ccccttcctc cccgggacct ttcccccttc 60 

ctccctggga ccttttcccc ttcctgttta anaanccagg gccgcctgga ggaagctttg 120 

tcagatctag tggaatgtga cctccctgga atatgtgccc aggggtttgt ctaagcagfct 180 

tcaggctatg gcctttactc catctggtcc ccatccctct tatctctctc atgtgtggct 240 

gcacctggac gcttggacca tagctgtcac agccccctgg ggaggaaccc actccttggo 300 

catntcagcc tgtgcaatgc aaggctcttg tttgatctgt gtgctgacan aaagcccagc 36Q 

ttccttaaga acttttcatg tggaacactt tggttttgan aagaaaataa ateanaaacc 420 
attaaa 



426 



<210> 33 
<211> 375 

%c212>;%NA ; ' £ ' : * j : ; i '■■ v - : "-'• ! ; ; ^.. 

<213> Ho^ J sapie : n; , '-s-;' ,J : •■y:-'\y^ \ / ; ;»>:.-- ■ : c :'-'\" -"-a*} ■ 

<220> " '■ * Utn • 

<221> inisc_feature 

<222> (1) . ... (375) \ 
<223> n = A,T # C or G ?7 

<400> 33 ;: - 

ngttgcacct attggccngc tggtctcgac tcctgacctc gttatctgcc tgectcggcc 60 

tcctaaagtg ctgggattac aggagtgagc cacagtgcct ggcctgtcaa gacttctctt 120 

aagttaaett cctgagaagt gatgtctaaa agtatctttg ctiggtgtgag aactccagtt 180 

tccaacacat attatttccc tcaactattt ggaatatttt agaattttaa t ttccaaagga 240 

ttagtttgaa tacaagtatg ccacataact cagttttcgc catcttncat tt'cttaacag „300 

tgtaaattaa aagctaataa tcataataat aaagtgcatt taattatctt cgaaaaaaaa 360 

aaancccttt tgggg 375 

<2ib> 34 . " : 'a' ' :s '-\ \ ; ■ 

<213> Homo sapieh ' ' 
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<220> • ., ... - . .• ; v . 

<221> misc_feat:ure , ; j — 

<222> (1) . . . (80?) : , J ' c i : 

<223> n = A ( T # C ; or G _ . . - /.. . : . 

<400> 34 • - y.:-^ ' ■ ■ • ' ■; r ly. • . ; , , \ ■ .\ - ' " 

ttgcacatgc tggccaggat : ggtctcgatc tcetgacctc gtgatctgcc <^rtcggcc : 60 
tcccaaagtg ctggaactac aggtgtgagc caccacgcct g^cagct tt g £^tct tt: tc ; 120 
tttctgtgat cttgccttag atcacacaga taaaacatga caggacctgg A^rtt 

agtttggctc tcaatcctgt tctcataacc acmactgcct tcatttatct gtgtcatcct 240 

cagacctgac acatagtagg tgctcagtca gtgttcacta agtaaatgat gaccaagaac 300 

tctttgactg ggtccaaggt gcttatccca atacttcgcc atggctacct ccctcattcc 360 

tcagctgact tgctctctct agcctggctg ctcctatttt atttcctaaa qatggacpca 420 

tggcaataag tttaaancta acangttgat acggtaccca tccataattt aatiiaattnt 480 

ggggctcatg caaccncaaa aaccagaacc caaaactacc tgtncaicaaa caacaatcat 540 

tttnggtngg gatcccntnc tngcttggnc ccttttttta aaatgtccat tccccccgga, 600 

ctttaagaaa ttgaaggaat ncccggaaan tattgttanc gggccccqtt iiagngaaaaa 660 

ggtggcnctc cnnncggggg ccctccctgt ccctgaaatt tnaaaacpcc qctcccnntt 720 

taaanccctt aatcccggnt aacancnaaa naaaattcta gggpccaaac fcannggttt 780 

ggttttaaaa aaccntntat ttttttnat 809 

<210> 35 ;■. . . • , y; , , - .. 

<211>M92 ; .,. {; . . ' .. - . ■ .v. . , - ■ : , • 

<212^ DNA u',.:-: : .. . ... .,, : ; v . Vl .-7'/ ; " 

<213> Homo gapi^n : . • : , 7; 

<220^ .., ; X, - ■:■ • ? V. • ;..;■-< . • . . , iV \ ' 

<221> misc^feature . : . x . . «. )• 

<222> (1) ... (192) < 
v <223> n « A,T,C or G 

<400> 35 ;', 

caccttattg ggatacagca gtgaattaag ctattaaaat aagataatga ^tgcttttat 60 

accttcagta gagaaaagtc tttgcatata aagtaatgtt taaaaaacat ^ g^attgiaaca 120 

cgacattgta tgaagcacaa taaagattct gaagccaaaa aaaaaaaccc caanggggnt 180 

nntt ttnaaa aa ■ , - : „ 192 

<210> 36 u ; . .' 

<211> 368 , x 

<212> DNA 

<213> Homo sapien 

<22X> raisc^feature ,- - ? -; '/ 7 ;i'7r\ ■ 

<222> (1) ♦ > ^<36?U . -.v -v _ . / ';; . 

/^<223> n = A,tv^ or G . .:' 7\;7' ; 7 

<400> 36 . ... .. V-'.-'-..- 

ctgctagtac caantattat ttaagantac ttttcactac tcctaaataa tgacacagat 60 

acgtttgtct tacacatttc actttattgt caagttatta gtatgtttat tttcaaaagt 120 

tattttttgc aatttctttt tattattccg tactttttaa atttacttca, ttatcacgtc 180 

ttcctttatt ctttttaaat agtttttgct tttgttattt tgttttccct.^.ttttt^ctct 240 

tggtttgtaa tacctctttc cttatttgct cctttctcat ttgatctcaa^ ^t^taatcca 300 

actgttttcc acatctgatt cactaaaatt ttagcccaaa aaaaaaancc cnttthgggg 360 
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gngntttt 



368 



<210> 37 ;; 
<211> 219 

i2l2> DNA : 
<:213> Homo sapien : 

; <220> 

<221> misc_feature ' 

<222> (1). ,.(219) *: 

<223> n = A,T,C or G - . r 

<400> 37 , . . , " : <; .; '>}■}? ....... 

ggccccattt /'ca'ctctccat' ahtggcnctt -nctngaacag gcgtnctgga tnagtgcaea : 60 
tacndtccca tcnacritgca cctatancric ttccactacg caeatcacca aanctgtgaa 120 
aggg^6rith r tcntta'gaba cacaattgca' gaatngacnn cncaricccgg gggannctcp rV : ;180 
aiigttca^CT^tgifo^ gc£nt£ata^- ■ • - " V i v ^ ;^ 1 219 

1 - 4 <2ld>^36.* ' : ' : ' 1 '■ " 1 ■ -i- • * i:--:- ' V,.:. ■/ 

<211> 198 • -"^ ' " ' • ^ • • f t.v ■ ;u 

<212> DNA 
, <213> Homo sapien 

''■it' 

<220> :.■>:*' 
<221> misc_feature 

<222> (1) ... (198) V 
<223> n = A,T,C or G 

<400> 38 ■ i; ... <\- 

tcgatacagg gncagatctg ggagccaggg cgttgctgat ga^ttgcaca gacgatcaca 60 
tctgaaacca ccagtaccac caccactacg caeatcacca ? a agcgcfcggc tcnggcaatt 120 
' aangaggeca aagagcanca ccctgacatg tcngtgaccn ttgtantggt ccntaangac 180 



<220> 1 V ; 'C-- .' rxj • v %o-s^ . \. ,;. ' * /, " 

<221> misc_feature 

<222> (1) ...(560) v V: 

<223> n = A,T,C or G <k M : ^ 

<400> 39 ^ 

tttnnatcng nacagctagt cctntaaant aatgacttca tagaaatggc attataattt 60 

ttaagttgat actctacagg tagctattga tataattagt tttaataaaa catgetgeaa . 120 

ccatggtata- caacaaaaat acatttcttt ggtgattgaa att:aaggccg tatttacaat 180 

gacttaatat aagactgact tttatcctgc ttcataactt gtatcj^agaa ;ctcaccaaga 240 

aagaattcaa tactgtgaaa tatgeagcaa gaagattggt ctttacctag gctgttgfcttc 300 

ctaagctctg agttttcagc accagtagat ttgtattaaa agaaaaaaaa atggggcctt 360 

agcttctggc ttttaatttt gecagctaag gacataaaac aaaantaanc: aancaaaanc 420 

aaat^gecat ^ntgctatcdg^catcattatg taaaagaaaa ; tntattttag cccctaaaat j 480 

taggaa^aat gtaatctcag aataaaggtt ^tcatttaag^ ttgaataaat . atntagcttt - 540 



acngacatcg cctccaca 



198 




cgaaaaaaaa aanccccttt •* 



560 
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<210> 40 

<211> 421 ' 
<212> DNA 

<213> Homo sapien , v 

, <220> 

<22l> misc_feature • ? 

<222> (1) ... (421) c . 

<223> n = A,T,C or G \ 

<400> 40 : .; o 

atacagggca gcgtgttagg tgaccacacc , aggagcctca gcctc^gtpc ttctcagcqg 60 

tcgggataag atccaggcat gncttttaaa tctcagaggt agcagtaaac t tttcantf>t 120 

tgcngt tagc aagtgtgtgfc ttgccaataa anccccatta tactaatgtgr cctanttaat 180 

gttcagggaa natctgcttc cactgtgtnc Wggggtgn gfcgagnagcc 240 

ccncrinctgg agggatgaat gctgngttaa ctacngctat cacggatngt gtgntgtgaa - 300 

naatacatcn acatnaatnt tanntgctct gnaanttcoc ttnttatntg tcaagtaact 360 

ntttgtaaaa ntnntnctcc caanttatta cngtgattac taatnnattn gtnccatgtt 420 

t 421 

<210> 41 
<211> 411 

<212> DNA v ; V 

<213> Homo sapien v 

<220^ 

<221> roisc_feature 

<222> (1) . • ♦ (411), . -: ? '". -sr--- 

i*223> n = A,T,C or> G •. ; , .• •;• • . --v — V 

<400> 41 .. . , r- 

aggtagaggt tgtgcatgtt gtccttttta tctgatctgt gattaaagca gtaatatttt 60 

aagatggact gggaaaaaca tcaactcctg aagttagaaa taagaatggt ttgtaaaatc 120 

cacagctata tcctgatgct ggatggtatt aatcttgtgt agtcttcaac tggttagtgt 180 

gaaatagttc tgccacctct gaegcaccac tgccaatgct gtacgtactg, catttgcccc 240 

ttgagccagg tggatgttta ccgtgtgtta tataacttcc tggctcqt tc ; actgaaca tg 300 

cctantccaa cattttttcc cagtggagtc ncatcctggg atccagtgta taaatcccaa 360 
ttatcatgtc ttgtgcataa attcttccca aaagggatct ntaatttttt 9 

<210> 42 > : 

<211> 408 ; 
<212> DNA 

<213> Homo sapien - ; 

^220x^: ivv "■=■.-■• ; ^•■*"""->^ ' * '-v - ' 

<22lv misc^feature , I--.- Y,: \* ■. Y.. '■- >..:=:; 

<222> (1) . . , (408) . • ;Vy;//-' : v.- ' ." j..- :' / ^}ry 

<223> n = A;T,C or G ; ; ^/- ; Y, • ' ,•' . - : :\;; } V< 



'■: <400> 42 •'■ :. :>.-. :--.rv';::r' :v '• -7- -f/^. . ••• ' ... . ^p- :V' ; - : ; 

ggctcccctc cctaactctc taagtactt e ; cjpt tacecac tcagtgtggt : gatggcacct 
ccctgaatct cetgacaaat gcgaaqagga actCGtattc atcaggagcc aacttgataa £ 
ctganaagat tcctctetca tttatcagcc tttgattatc tttttgtgtc tcttactatt 
tgcgcttagc gagaaaaata aagaggtttg aacaattaag aagtaacaaa gagctcatag 



411 



60 
120 
180 
240 
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ttcacaaaga gcaantcaaa ggatgtctgg aatatttgaa catacaactg cctttggcat 
gaggtggcct acatacattc tcaggggcag gataggctgg nanagctgat caagctgccg 
ggaaagctga agcaaaggca gggttggntg gaaatcaaaa tntctctt 



300 
360 
408 



<210> 43 
<211> 275 
<212> DNA 
? <213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1) . . . (275) 

<223> n A,T,C or. G , . ^ , ' -( ' 

,<400>. ; 43 ; ^ ^ " ' / .;,v V- - ' '". ' . % • " '* ; " l' . 

tccctaactc .tctaagrtact tcccttaccc i actcagtgtg 60 

ctcctgacaa .atgcgaacag. gaactcctat: tcatcagagc caacttgata actgagaa^ga \ 120 

ttcct'Gtptc. atttatcagc ctcttactat: ttgcgcttag ' 180 

caagaaaaat , aaagaggttt gaacaantaa gaagtancnn g^agctchta . gt: tcanaagn 24 0 

agcaagtcaa aggatgtctg gangatttga agggt " '.{['.'_ "]',['' v. 275 



? <21^> 246 . , . 

.<2i2>^rm.^l..?/.. • . 

<?13> Homo sapien - 




<220>. r 

<221> misc_f eature. 
<222> (1)". . . (246)"' 
<223> n = A,T,C or G 



<400> 44 



tttggtccca agcacatttc acaaangaga atttacacct agcacagctg gtgccangan 
atntcctang gacatggcca cctgggtcca ctccagcgac agacccctga caagagcagg 
tctctggagg ctnantngca tggggcctan tntcntcaat cn aatgagcc ccnahtgcta 
ctgcgceccg ggggctccca cggcctgggc nnctttcntg caactgnaaa aggatagngg 
tatttc 




60 
120 
180 
240 
246 




<210> 45 
<211> 345 
<212> DNA 
< 2 1 3 > , Homo sap ien 



<400> 45 



tttggctccg tgggacgttg tantgtgcnc agacatttcc aagggaaatt ctaaacagtc 
accctnccct tttgcattcc cccaaatctt aagtgtatac ataaaaccct gggtacatat 
tgtngtggta atagaaggga attggnnaaa cngtacactt gttatatgga antnactgtg 
gccacctaca aaagacaagt taacaaactg tcntggaggc tgtngntgcc canccagggc 
cgctgcnttt tgacaacatt cccaccctgg ccactcagca cant t cat gg caggtcatgt 
ctntncactg anacntttnt ganacttttt catatagcan aatcc 



60 
120 
180 
240 
300 
345 
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<210> 46 

<211> 969 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1) . ..<96?) 
<223> n = A,T,C or G 



<400> 47 . 
ggccactaag caggtcttac cnaatttaag aanattgaan tcctatcaag tatctcttct 
gaccacaatg gtatgaaact agaaatcagt aacaggagga aaattggaag attcacaaat 
ntgtggaant taatcaacnc atgagcaact antgagtcna agancanatc aaaagrggann 
tcaaaaactc tcttgaggtg gatgagaatg ganatacaac ataccbgaac tcatg^gatg 
tatcacaagc ngtgctaagg gggaagttta agtnctagat gtctanatt a ngaaagggaa 
agatctcana tanacnaccc agent tnene ctcgaanaac tagaaaaact aagaaaaaac 



<210> 48 

<211> 364 

<212> DNA 

<213> Homo sapien 



60 
120 



240 
300 
360 
420 



<400> 46 

aattgcagtt etttettgee tttaacaaca ttagggcett tagaatgagt acctggtgct 
gtccttccaa ctctgtgatt ctctgattcc atcctcattt ttcaceatea ctggtgtact 
ggcaagaacc antatgagat ttgaggaaaa atacttggat tactcttttt taaaaaaaat 180 
tatttagajta taattcccat accatacaa* t aacctt ttt atgtgtataa ttcagftattt nAn 
ntagt^altc cacaaa^tt^ t^tacca'tfe accac t at cc gat tccagag c t t g tcatca 
taca^Waaa aaaaccccah agtnant&c; tttcaaaacri ctttnrtgttri ttcntthtnc 
ccntgti^cn tetagnheng i^^tnrict tttgtcnhth tcnccctnch ctcatcnthh 
cnggtctctg ctcngngnnn cgntntgnct 1 tnriantcget gctnntcntg tat tccccgc 480 
nctngtnnng tetgennegt agccagtggn cctcctgntn cenncngntt etntntnegg 540 
cacanntcca nccanctgcc atnagtnana nnatctctnt tenncanctg ritnneaghnt 600 
tgtcntcntc tccgtnccnc engengctnn ctcnttncgc nctggnngnc ahtegtaect 660 
ggcttttatc cccctntccn nctnttctng atggnntctc ntctcnacac ctgnegttac 720 
gnntctcntn tnnennnann cgttnctntn tnncttnccg nchgccatct riagctcannc 780 
tggngegant cncgctctgn gtatcagtca tntanagann ngngnntgtt nccnncgcgn 840 
nntgagannc ccncccnctt cgcatnacgt angtgncttt ntnnatctgc tcgtcgtctc 900 
nctcatatcc necatgetgn catganactc cntantctnn cgcnnttctn ricgttccctc 960 

969 

tgcccttnn _ 

<210> 47 
<211> 361 

<2i2> r DNA : ' : /^'.J . / ' ' "' v: "S. • 

<2l3V Homo sapien '' *' ' ' '"' '•' 

<220> ■ . : "' ■ ' - ' ~ :: - ! \' ■ '"■ ' - " ' 

<221> misc_feature 
<222> (1) ... (361) 
<223> n « A,T,C or G 



60 
120 
180 
240 
300 
360 
361 



<220> 
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<22l> miscfeature 
<222> (1) . . . (364) 
<223> n - A,T,C or G 

<400> 48 

atgatgacca catntagatg gcacatngat gaggacttta atctttcctt aaanacaata 60 

atgtgttctt ttttctttta ntcacatgat ttctaagtan attttncatg qaggacactit 120 

ttttfaacctt gatgtacant gactgtgtaa aatttntctt tcagtggcaa cctctataat 180 

ctttannata tggtgagcat ctngtctgtt tagaanggga tatgacaata aatctateag 240 

atggaaaatc ctgttacaaa gtataaaagc tttagtaatt tactcagtgt ggtggtttta 300 

tcctttttgc tttttctccc ttggtctata atgaaattgt tacagcagtg caaaataaaa 360 

tcct 364 

<210> 49 ' / ' "' l ' ' " - 

<2fli> 703 ( '' ' \'[ ■: '• '/ "'' •■ : ; . 

\<213> Homo sapieii * ! ""'* - T "'" ' '* - - M . ' 

<220> 

<22l> misc_feature 
<222> (1) . . . (703) 
<223> n = A,T,C or G 

<400> 49 

atggggaatc aaacaatgtt aaaaggctan taatacttat aggttttatg attca'afetta 60 

ctatgtgttt aaaattgttt tttgaaaaaa ttgagttatg tchctaaaac tgagtctrita 120 

cagctcaaaa atgaagaaat aentatctcc gataagcata ttatgtgaat ttcaaCatcn 180 

ctattgagaa aaggaatata aatttgaatg aaaatgaaac tctatctttc tatatcacat 240 

tgcataggtg taggctagtg agtactttga tgtaaattgc tgtatctttt gaggcntcna 300 

tttggcnata tagatcagaa ttttaaatcn gcatactttg tttgccagaa atctatcagg 360 

accacttgta ntnattttgt triaaaVgaat atcnaaehct tggatgttca hcncagtatt c 420 

gattgtt tta naagaaggaa artg^agaaag ggaggagaat ggaaganana aanggaggga 480 

ggaariait;tg0 aaccn'tt^c c a£ritgtgata gcatngF^att tgctnaac^c hctatanoat ! 540 

acecctn^ca' ; tggganaia^c at^caCTCtti n aaacaaggac imgttngatg gntctacnht :. 600 

- ttgacritcag afcnnaarita'a atnaaaaaaa aaanccc - €(50 

cgnnnhahncr .'; ''V ' . -703 

; <2ii>; 413;" V I / *"'.- r " ■ • ■■" . ■ v ' ; ' : ' : 

<2l3> Homo sajiien 1 



* <221> mist_feat'ure '■■ '-' : ' ■ ' ' 

<222> (1) . . . (413) **•-■.; - ij, ; :i - r , 

<223> n « A,T,C or G 

<400> so '" : 

tcttggctgg ttgagtattc aanaatcagg cacggagaag tggggtggat gcaaac£aac 60 

tgaccactgt ggcaccacca gdagtttcag ttttcatctt gahtigtchag aggaaatatc 120 

taatcttaca actcnttagg ggcctggctc agtggctcat accttgtntt cccancactt 180 

tgggangcog angcnggcnt atcacccgca ngtcaggatt ttgagaccac cctggGcaac 240 

ntggt^aaac cccatctGta ^ btantcaata i caaancttag . ctangcgtgat tggcatgcac 300 

ctctaatccc acttacttgg gahgcttjagg cagcganaat "cacttgtaad tcggaaggca- * 360 

nacgttgcat htgagccaag atcgtgccac tgcactccat cctgggcttt eta ; 413 
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<210> 51 

<211> 252 

<212> DNA 

<213> Homo sapien 

<220> . •: .• ' " - 
,<22l> misc — feature 
<i222> (1)> V^52) > 
<223> n>= A^TvC; or G 



<400> 51 

gttacagaca aggnttntag aatatcttat gtttr.atgct ctgtaagttc aaagaagnta 60 

gcagaaaaca taagcatact gaaaagagaa acagaagcta ttttttaaat acctatgtga 120 

aatctctcta tntgaaacaa aaaatacact ggatggatta gacactgcag aaggaaaatt 180 

tggtgaactt gagatcttat aaataaaaat tatccaaaat gaagtgtaga gftgaaaaaaa 240 

aaaancccct at 2 ^ 2 

<210> 52 

<211> 875 ( -V; ' 

<212> DNA ' 

<213> Homo sapien 

<220> ■' ■■ ■ ■ . -.*i--?.r 

<22l> misc_f eature 

<222> (1) . . .{875) v 

<223> n « AJ,C or G • :, ; . ; ' . . . t \... . 

<400> 52 ' Vj . .•,j r; -;;^ %r - ": > .,..;.:,^,; t , ; • 

agaaacgaga atgganattc aaatacgtcn gpcgggctfeg g c^tgtaaccc 60 

haacactttg ggaggnctag gtgggcggat caccngaggt c cra^a^acg ggaacancrc t 120 

ggcaaaaacc cent c t tt an t efcgrigaaaa cncaactcta ctaaaimaac tactctt^ga 180 
tnggcgtngn tgcgcctgcc tgttntccca g^taennttt n^ggctgang tggggataan ^ 240 
- tnctttaaca tgggaagtgg aagttgcact: gatccaat^ , 300 

tgggttangg aatgagaccc cnchcacgga aaggacaata aaaanccccn ni>gghnt tnn 360 

tttttaangg cctcttgntc nttttcttnt antgcncgcc tnegennnen ttgntntgtc 420 

gantennntg ennttnttte ttcnncctcn ancctgcttc tnntenntte geenntnnac 480 

ngcttccccc ntnetctage actthnnttc tntcgntccn nhatctcenn cttntctnnn 540 

ccgctcgcgt nnnccntnan etcgnntcnt nccctttctt cncngcnncn n 1 1 t<^fhcna 600 

gatcgtncgn ctctatctac ttetntcenn gntntanata tnga^nttac attn£gctcn 660 

atnacccatn annncntcta tgtttatann ngtnnnnccn ttcaaennnri crittatgagn 720 

tcttnactca getctnegtt gntnttccna ctanngttgn nentneatgt netgtcncgt 780 

ancnctctnc tchtCTicrigt cntgagacna atetctatnt atngnt&atn cctgcntnct 840 

ganctncacc gngatctegg enntntctte tcaag 875 

<210> 53 
<211> 182 

. <212> ^ ^ ? : . ; . r ...v- ..... • . , . T - , -. :;. - ; ; 

<213> Hchto sapien,/ . 

• <400> 53 -- i; 'v- : ■ ; ~r.rr./- . ■ r-.- i.*-;/ . ;s 

ccagaagaag ggetaeatat ggactcatgt t 60 

ateagttgee aaccatttgt agttcacaaa ^ ttaaaactgg gtttccaggc ctggtgtggt 120 

ggctcacgcc tgtagccccra getattgeae cactget^^ 180 

ga 182 



WO0OT22136 



21 



PCT/US99/24222 



<210> "^4 ; 
<211> 329. 
,<212> DNA 



<2i3>/Horop .sapien r 





<406> 54 



<210> 55 

<211> 312 

<212> DNA 

<213> Homo sapien 

<220> V " 

<22l> misc_feature 

<222>, (1) . . . (312), ; - 

<223> n = iyVf , Q ,or G './V. " \; ; ^"J ■ \ ' ; " / 

,.;/<:4^0>;i55 ^ .. V . . V^V-*-'-*; 'Z- J -v. ; : - 

actcaactcgj tttgagctat aggaatriggc ; catfcc^Tmcrt ggctcahacc tgtaatccda - 60 

gnatttnggg anacctcact a^gatcacnt g^ 120 

acatggngaa accccatctc tantanaaaa .^acagaa / ' 180 

acccgtaatc cca^ctactt gggaggccaa ggra 240 

ggaggttgcg gtnanctgan atcatgcxrat tgctctccag cctcggcdac ^ anatca^gac" : ; 300 

cctatctcaa aa, t .; ' "*'.-' x "\ : '\\ _ 312 

. <zio> 56 ;;. \ ; * _| ■ ' r ' 

<211> 565 ' v '? .-.^ .-■ /..o ■ 

<:212> DNA ° ' 

<213> Homo sapien 




<400> 56 



<210> 57 
,.<211> 798 
<212> DNA 
<213> Homo sapien 



<400> 57 
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ggaacaagta gaagggaaga gggaaatgga gagcatcctt atgactttac aaagggtgga 60 

aatgaggatg gagggataca gaagtctgca cagctgtaaa ggttttatag atgtctttgc 120 

cttcccttct gaggaaggga agaagtaatg aaagcacatg tgaataaccc cttccatccc 180 

attcacagca tcgcactccc agtccttaag gcaaagggag gcagtgctga asjcattggtg 240 

gtgcagtgta aagagacaag acctgatcat ctgatcacac ttgtgccaac ttgattcata 300 

ttgggcatta ctaacaaccc ctggtcaagg taaataggtt gaaca^teaa taacattatc 360 

cctgcctgca tacatgtgaa caaaagctat agaggacatg caaattctac agtcattcct 420 
catatgcttt ^|acagagtg cagctactgg aatcttccag atttcagt^t tttaaaatea- 
gagctctgaa tacacaaaag gaaagagaaa tggaigcagct gacatatttt aagctcacag 
tgatactcag t^acagga^c acagagctct aatigtccaca ^atgttgta gggtagggtc 

tctcagtaaa tcaagtccct tacctatgtt ctgacactga ggctcttgga gctatgggtt . ,■ - . \ ■ 660 

agaaatccag ga^gcaatat gtctttattc taatgaa^tc ctcatcttgc actcagaggc 720 

ccactagttt gcccttctat atattaagta aaaccaaga^ aaattaaaaa aaaaaaagcc < 780 

ctatagtgag tcgtatta 798 

<210> 58 

<211> 729 

<212> DNA 

<213> Homo sapien 



<210> 59 

<211> 730 

<212^ DNA 

<213> ^owo sapien 



480 
540 
600 



<400> 58 

aagaatagac cgagataggg ttgagtgttg ttecagtttg gaacaaoagt ccactattaa 
agaacgtgga ctccaacgtc aaagggcgaa aaaccgtcta tcagggcgat ggcccactac 
gtgaaccatc accctaatca agttttttgg ggtcgaggtg ccgtaaagca ctaaatcgga 
accctaaagg gagcccccga tttagagctt gacggggaaa gccggcgaac gtggcgagaa 
aggaagg^aa gaaagcgaaa ggagcgggcg ctagggcgctT ggcaagtgta gcggtcacgc 
tgcgcgtaac caccacaccc gccgcgct ta atgcgecgct acagggcgcg tccattcgcc 
attcaggctg ^caactgtt gggaagggcg atcggtgcgg gcctcttcgc tat tacgcca 
grtggcgaaa gggg^atgftg ctgcaaggcg attaagttgg gtaacgccag ggttttccca 
gtcac^acgt tgtaaaacga cggccagtga attgtaatac gactcactat agggcgaatt 540 
gggccctcta gatgcatgct cgagcggccg ccagtgtgat ggatatctgc agaattcggc 
ttgtaatacg actcactata gggctttttt ttttttcggt ttgaggggga atgctggaga 
ttgtaatggg tatggagaca tatcatataa gtaatgctag tcttatcctg tgtgaaattg 
ttatccgct iv: 



60 
120 
180 
240 
300 
360 
420 
480 



600 
660 
720 
729 



<4oo> 59 - ' .. '.'.'"'*.. .. i 

aagaatagac cgagataggg ttgagtgttg ttccagtttg gaacaagagt ccactattaa 
agaacgtgga^ ctccaacgtc ^^ aaag^cgaa aaaccgtcta tcagggcgat ggcccactac - 
gtgaaccatc accctaatca agttttttgg ggtcgaggtg bc^aaa£ca ctaaatcgga 180 
accctaaagg gagcccccga tttagagctt gacggggaaa grcc^gcgaac gtggcgagaa ?40 
aggaagggaa gaaagcgaaa ggagcgggcg ctag^gdgct ggcaagtgta gcggtcacgc 300 
tgcgcgtaac caccacaccc gccgcgctta a£gcgccgct acagggcgcg tccattcgcc; ; 360 
attcaggctg cgcaactgtt gggaagggcg atcggtgcgg gcctcttcgc tat tacgcca - 
gctggcgaaa gggggatgtg ctgcaaggcg attaagttgg gtaacgccag ggttttccca 
gtcacgacgt tgtaaaacga cggccagtga attgtaatac gactcactat agggcgaatt 
gggccctcta gatgcatgct cgagcggccg ccagtgtgat ggatatctgc agaattcggc 
ttgtaatacg actcactata gggctttttt ttttttcggt ttgaggggga atgctggaga 
ttgtaatggg tatggagaca tatcatataa gtaatgctag tcttatcctg tgtgaaattg 
ttatccgcta 



60 
120 



360 
420 
480 
540 
600 
660 
720 
730 
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,<210> 60 

.«2l;l>J:623 

ot <212>T.miA 

t <213>vHonio sapien 



<4O0> 60 

gactccaaga gaagactagg aagta^ccct cgttctccag ggcacccaaa ataccagcct 
ttattgtctg catgatttta ggggatatgg ggagggaaca agtagaaggg aagagggaaa 
tggagagcat ccttatgact ttacaaaggg tggaaatgag gatggaggga tiacagaa^tc 
tgcacagctg taaaggtttt atagatgtct ttgccttccc ttctgaggaa §§fgaagaagt 
aatgaaagca catgtgaata accccttcca tcccattcac agcatcgcac 'tcccagtfect 
taaggcaaag ggaggcagtg ctgaagcatt ggtggtgcag tgtaaagaga caagacctga 
tcatctgatc acacttgtgc caacttgatt catattgggc attactaaca acccctgggc 
aaggtaaata ggttgaacaa tcaataacat tatccctgcc tgcatacatgt^aabiaaaag 
ctatagagga catgcaaatt ctacagtcat tcctcatatg ctttagacag agtgca'gcta 
ctggaatctt ccagatttca gtgctttaaa atcagagctc tgaatacaca aaaaaaaaaa 
gccctatagt gagtcgtatt aca 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
623 



<2lQ> fl 



.<211> 37£ 
<212> DHA 
<213> Homo sapieri 



<400> 61 ; 
gcatgctcga gcggccgcca gtgtgatgga tatctgcaga attcggctta gcggataaca 
atttcacaca ggatccatga ctcagctatt aaggctctgg ccttggat cc c^atgagigaa 
tattttacca caggttcagc agaaggtaac ataaaggttc ggagattgac a^gccat^gc 
ctaattcatt catttaaaag tgaacatgct aagcagtcca tatttcgaaa cattgggrgct 
ggagtcatgc agattgacat catccagggc aatcggctct 
acgctgaaaa ccagggtitct gcccaatgct tttaacatcc 
ctataaagat tggggt 



<210> 62 
<211> 539 ^ 

, „ <2 1 3 , Homo sapi en 



tctcctgtgg tgca§atg[gc 
c t aacagaa t * tct tgacat t 



60 
120 
180 
240 
300 
360 
376 



<400> 62- ' '■ " '■"* ' • '"'y!'"'' : '-'-' y ■' 

atgactcatt gtttctctgc ctttccgtgt gttacaggtg ggctgatccc cctgcagcca 
gtttcccata agcaactgac ttccaactgg gaatgtctcg ggggataatg g^g^t^gga 
tatggaagta tagagaaaac ataagaaaat actgggtgta tacacctttc' ttt^fctgag 
tatgatgaca atgtgatagt cagtgtggca tctgcgactc cagettgtgc^^ggcatgta 
caccctagct ccagcttccc ctgggagact gtgcatctcc tg§ctccact 7 ^acabt3cct 
tcttctgacc ttccagccta gagatgatga ctctgccagc ctagatgggc tctgggttgt 
ctccctattc ctgtttgctt tgtagatttc ccattatgct gtcaccaact ccccacjccta 
agccctctct attttaaatt ctcaagtgga ttatgttcct gattagfccce t^actfc/atat 
accactctcc tcatgatctc tgattagttt tcctgttagg tt^%gfcagt : . ; Ma^aaiaaaa- 

<2X0> 63 

, : <211> 304 . , ... ; ."; * 

t °" ^ii^ mk I . Yv... .'[ '[, ;. ■ 

\ <2li> Jfomo sajpieii; . • -X\ v 



60 
120 
180 
240 
300 
360 
420 
480 
539 



<400> 63 ' '■" r " • ' - 

ggcttagcgg ataacaattt cacacaggac gactccaagc tgggaaggaa aattcccttt 



60 



7 i FCIVUSW24222 
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tccaacctgt atcaattttt acaacttttt tcctgaaagc agtttagtcc atactttgca 120 



180 



300 
304 



ctgacataet ttttccttct gtgctaaggt aaggtatcca ccctcgatgc aatccacctt 
gtgttttctt agggtggaat gtgatgttca gcagcaaact tgcaacagac tggccttctg 240 
tttgttactt tcaaaaggcc cacatgatac aattagagaa ttcccaccgc acaaaaaaaa ™ ft 
aaag 

, r ,<2l0> 64 . .... ... 7" .. 7. . _ v : '. 7 - :> /; :- : '. '. '/ ' ,7 ;. 

<2ii> 226 < ; 7' " ■ "" _ '.. ..' • ..." .. . ' ; 777.: u ; 7V 

7 <212> DNA '■''_* , -7. ; . ; 77777 7, y i.vVCV, ."■ 777* - 7- • 

<213> HcKto sapien : , ,] : '• _ v •. . . ,7 -\7 -.7 77 7.. 

<220>v 7 7 7- 7 .. " 7777 ■ 7-77 7-r 7^,7 ' '77' 7-" V;; i' 

<221> roisc_fe.ature .. V .; ; ' : . 7,. ./ .7-. 7 7 ; ^7 

' <2?2> a) . V .(?2^)V ; ; - 777 7 ,,: 777777 ; ' 77 7 7.7' 

<223> n - AJ^orG, ; 777777-7. 7: 7 77'. 7^ 77:^7- ,71> ■ 

- <400> 64 

atgatgatga ccatgtggac agccaggact ccattgactc gaacgactct gatgatgtng 60 
Stgacactga tgattctcac cagtctgatg agtctcacca ttctgatgaa tctgatgaac 120 
tggtcactga ttttcccncg gacctgccng caaccgaagt nttcactcca jt^gtccccc 180 
cagtagacac ntntgatggc cgaggtgatg gtgtggttta tggact ' 226 

<210> 65 . , 

. <211> 225 . . . .... . :;> .. ;> , ... ..... ... _ ,.. ...... 

; . <212> DNA ' :7.,7.--. • 7. , ; r>s ) 7,; • ,. " ' 7 S , 7 V 7 •- . " 77 7 

<213> Homo sapien , ' >:: . " ./ , v , ,,' ... .., :t . ■ 7 . , : ; ,/■ 
<221> misc_ feature . , , . ';\;:~X' T .. •. , 

<222> (i) .7. (225) ' " ' " •' " ' ' ' '"X v -"" 7;/ -7 7.7/ 

. <223> n « A # T # C or G 7 

<400> 65 ; : . 

. taccaacaga gcttctgaaa cagataccat agcattggag agaaaaacag ctcacdgt ct 60 

gaggaagatg atattganag aaggaaagaa ttgaaagcat ct tgaagaaa aacteagatt 120 

ggatntggga ttggtcaagt cggccggata atattccccc caa^gagttc ctctttaaac 180 

acccgaagcg cacggccacc ctcagcatga ggaacacgag cgtca 225 

<2io> 66 - _ Tc ,.7 7-:7 ;.-7:-'" .■ :^7 ••• . - . .y. i .^7-." 7- ■ 7 -.v*7v7 77^7 ' 

7 <21X^240 X : ; , 7777 ;:7 : .7 .7 7'v :^ ■ ; s 7v ^- 7';. .7 77.^:77 

<212> DNA h 77 : - - 7777 7- ■■ * --7^ >7v'"7'' •" ; 77>^7.7-' 

<2 13> Homo sapien ." ^ 77?: 7 7';. . .; ; .-7 ; . ; t . ' ,7- 

; <220> .. ;7 ; 7 / 7-. \ ; " ;:7^7'-.--,7;7. , '. ; 7 } ' 7 o7^^\77"-', 

<221> msc^featiire/ -. r . ..7 .J;, v . 7 v . .' :'/ . \ 

7 _ <2?2^ a) ' 77 ■'77'77..^7 77--77- / 7- 7, : 7 v, 

: <223> n = A,T,C or G V 

<400> 66 !7 7 7 60 



ccagcatggt ggccgtnatg gatagcgacc cacangcaag ctgggctttg ;^^ttcaa ^ 

180 

gaatgaacan ctctataaca tgatcatccg acnctactca gatgaaagtg ggaacatgga 240 



gtacttgtgg aacaacatca aaaggtggca ggccatatac aaacagtacg ^cactgaccg 
atcagggacc atgtgcagta gtgaactccc angtgccttt gag^agcan ggttccacet 180 



<210> 67 
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<211> 504 
<212> DMA 
<213> Homo sapien 



<220> 0\. 
<22i> raisc_f eature - v.* .5 

<222> (1)...<504) r 
, <223> n = A,T,C or G 

<4Q0> 67 . , ■'."'■■,'*""* 

; cacga<jg*ga gatngcatct: gttatatatt; cc^Gngat^ atgtgagtna, pt<gatagaaa< 
. aaatcgcrihc ggngaacacC ^cice^gftn ccggccfeccg §tactaragg r gA rtch^ 
gacttcaccg tntaetacaa ngtaagciicc cttitaagaat Igtcacggagt at^atgggca 
ggatgcctgc ggctccaaca nctggaacni> %gtggac^tg ;'gacctGcc^c: ccaacaagga 
cntggagccc ggcatcttac tacatgggct gaanccctgg actcagtacg ccgtttacnt 
caaggctgtg accctcacca tggtggagaa cgaccatatc cgtggggcca agagtgagat 
cttgtncatt cgcnccantg cttcngttcc ttccnttccc ttggacnttc tttcggcatc 
aaactcctct tctcagttaa tcgtgaagtg gaaccctccc tctctgccca acggcnacct 
gagttactac tttgtgcnct ggca ;v : ; v r 

<210> 68 

<211> 462 v, • 

<212> DNA U ; - ; J • ..■.>,■■■.■.. 

<213> Hcmo sapien ; * > ; 

<220> , ; ^ 

; <221>vtnisc_f eature ■[.■>'■■ : " : ' - ^ . : -^o ;o<-;iy 

i222> (1).-... (4(52) ; - ' 'Vy. • - ^ : ' 

<223> ri - A/T,C or G Y ? ' , — ' - -v.>r 

/<i'oo> : 68 ■ . :y ? ; : ' iyY.-. -.VJ^. \^ : ;?.'.-,*. 

tggatggcag OT§9agaaa<j* gaaaagcaaa acactc<:ag*j acctctcccg gatctgtctcv 
ct cc t ct age * cagcagt a tg ; ; gaeagctgga k cccct gaac t > t cc t c t cc'tc [i 1 1 acc tgggc 
agagtgttgt ctctccccaa att tataaaa i actaaaatgci ,atnccattcc : tctgaaagpa , 
. aaacaaattc ataattgagt gatattaaat anagaggttt tcggaagcag atctgtgaat 
atgaaataca tgtgcatatt tcattcccca ggcagacatt ttttagaaat caatacatgc 
cccaatattg gaaagacttg ttcttccacg .gtgactacag tacatgctga,* agcgtgccgt 
ttcagccctc atttaattca atttgtaagt agcgcagcag cctcrtgtggg ggaggatagg 
ctgaaaaaaa aaaancccct tttttngtnt nttttaaaaa aa : . ^ ;> _ 



60 
120 
I 180 
240 
300 
360 
420 
480 
504 



60 
120 
180 
240 
300 
360 
420 
462 



<210> 69 

<211> 357 

<212> DNA 

<213> Homo sapien 



<400> 69 . >•;•;. <-y-y*' 

agaagtcttc ctgagccttc /catgtatGCt c^gt 

ccaaagctaa 1 aggatijatga g^tgctrcag /ctcaagaaaa ■ ^tggagatac cct^tsggac ; 
atccagaagg acctaaaa^a • cctgtgacta; ^gagctcta 

tacaat^tat ^taactcgatc crtttagtttt ^ .catccatgta catggatcac la^ttgcjttt 
gatctticttc aattgtgaat \ttgggctcac : agaatcaaag icctatgcttg^ ^ gtttaafcgrt.; 
tgcaatctga gctcttgaac aaataaaatt-aactattgta gtgtgaaaaaVaaasiaaa. 



60 
120 
180 
240 
300 
357 



<210> 70 
<211> 226 
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<212> DMA 

<213> Homo sapien 

<220> 

<221> misc_feature 

<222> (X) (226) 

<223> n = A,T,C or G 



<400> 70 .:■■> 
atgatgatga ccatgtggac agccaggact ; ccattgajcrtc gaacgactct gatgatgtijg 
atgacactga tgattctcac cagtetgat^tagtct tctgatgaac 
tggtcactga .ttttcccncg gaectgcoig caaccgaagt rit tcactqca gt f gtccbcc 
cagtagacac ntntgatggc cgaggtgatg ^gtgtggttta^ . tggact ; 

<210> 71 ..->■ . -. > >■ . V= 

<211> 477 ■ • . ■ -W. * v ? :> - ' : ", V" ... ' -V 

~ <212>: DNA :• ' - - : .- . .? ■ / -i v : ;y • ■ :>•'''• ^ 

<213> Homo sapien 

<220> 

<221> misc_feature 

<222> (1) ...(477) ^ 
<223> n - A,T,C or G 



60 
120 
180 
226 



<400> 71 

agcagacaag ccacaattaa catagggtac aattgggtca tgtagctcat gggaaatcca 
cagtcgtcaa agctatttct ggagttcata ctgtcaggtt caaaaatgaa ctagaaagaa 
atattacaat caagcttgga tatgctaatg ctaagattta taagcttgat gacccaagtt 
gccctcggcc agaatgttat agatcttgtg ggagcagtac acctgacgag tttcctacgg 
acattccagg gaccaaaggg aacttcagat tagtcagaca tgtttccttt gttgaetgtc 
" ctggecacna tattt tgatg get actatgc tgaacggtgc ageagtgatg gatgeagetC: 
ttctgttgat agctggtaat gaatcttgee ctcagcctca gaeateggaa ^ acace^gigct 
gctatagaag at catgaaac tggaagecat 7 att ttgaatt ctacaaaata aaattga • ^ 



■ ^<210> 72 :• , 
?>_. <211> 374 
i;i:-<212>Vl^--;-^v::.;r' : - 
<213> Homo sapien 



60 
120 
180 
240 
300 
360 
420 
477 



<220> 

<221> misc_feature 
<222> (1)...(374) 
<223> n « A,T,C or G 



<400> 72 ;^ 
ccaagccaga ttgt^etcc: agctgatctt ;ctttgatggt gaagaggctt i ttct^cactg 
gtctcctcaa gat fcetctet i- at gggt ctcjg acact t aact gea^agat^gg; cat^acccc r 
gcacGcaect ggagegagag gcaccagccacactgcatggc atgg^tt t at tggtct tatt; 
ggatttgatt ggagctccaa acccaacgtt tcccaatttt; ^ttcccuiact c^ccaggtg 
gttcgaanga cttcaagcan ttgaacatga. acttcatgaa ttgggttfcgQ;;.^^ai^%t^ 
ct ct t tggag gggegg tat t t ccanaa 1 1 a; cagttatgga; ggtgtgattc aggatgaccn. 
ttttccattt ccaa 



60 
120 
180 
240 
300 
360 
374 



<210> 73 
<211> 597 
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<2X2> DMA 

<213> Homo sapien 

<220> r ■ 

<22l> misc_feature ' 1 ^ 

<222> (1)...(597) 
<223> n ~ A,T,C or G 

<400> 73 : ' ' ; : P[ 

ccaagggatc tgtaaagaat atatacttga gtggtgtgtg ttatcagata aagcactctg 60 
tatcacagac tggcaacaag aagatggtac cgtgcatcgc acctatttaa gagggaactt 120 
agcagagagc aaatgctatt tgataacagt tactccagfca t;atgctgatg gaccagfgaag 180 
ccctgaatcc ataaag^cat acbttaaaca agc^ tccaaag^c^btact^tcg 240 

gacaaaaaaa gtagggaaaa acgaagctg^ 300 
tcanaa'fcg^ tttateagaa v 360 
tgctgtgaat gt^ 420 
cattgtadat >gig^c^^^ ggtccaaaat " r 480 

tcacttttac taccccalia^ 540 
tgcttancat tcctattgac aactcttctg ggaatgctgt* tttgctfetaa' fcaag<^a -^^^ ^ - 597 

<210> 74 " 

<211> 257 \ ^ : 

<212> DNA " " ] 1 ; 

<213> Homo sapien f?> ^ ! ~'"-'- : 

<220> 

<221> misc_feature * *.»•■-■■• 

<222> <1>...<257) ; 

<223> n « A,T,C or G 1 r ' ■ ; 

<400> 74 

" tggt aaaggg * taat agccacf * agtinfc agaac cttgangaga ' tgcggccaan §at cctt¥a t V 60 

atctgaacdi* -agktgtnaaa 120 

taattthbae & 180 

gatgtgtt^ic ttgat'^g^ 240 

attcct^bt^tgegdcc^^ ' ■: , :-'-v-<. , v^ v - -js- j- ; C; . My- >>. . : .• . / 257 

<210> 75 
<211> 330 

<212> DNA - 
<213> Homo sapien ' 1 '■ * " V. 

<220> 

<221> misc_feature V.. 

'ii22>- h) . ■ ;-(^30y; ; ; - ■ ^"^ii^'^^'v'^; '^-'iV- ,. . 

'' v '<^3>'n;=''iiTi : b'br : ° . :> V^v: 5 "^^/^ ■'■ VJ '-r ) ' . 

•■" ^foo* 75 ? ^i^r"'-^. - ;i - ■'"'•> r - 1r; /" 3 >^* : H^:V ; *";. ^ y; V v— ■ 

tgttcataagf g<Aggtg4ta ; m ,60 

ctgtgtatgg aggtt^na^rC cacaatacgc OTacgari^ 120 
catcncttac accatcctca gccaagatcc tgagctccct ^acnaiaata- t^tcnccat 180 

taacaggaac gcaggagtca tcggtgtggt cnccactggg ctggaccgaa agagtttccc 240 

tacgtgtacc ntggtggttc aagcngctga ccttcanggt gaggggttaa fccacnaeagc 300 

ancngctgtg atcacagtca ctgntaccaa ' %i 330 
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<210> 76 

<211> 387 

<212> DNA 

<213> Homo sapien 

<220> . ' v , . > "--\ 

<221> misc_feature 
: <222> (1) ... (387) 

<223> n = A,T,? or G . . ,., , . . ........ - 

gctcgcgcgc ctgcaggtcg acactagtgg^at^caaagaa, ttcggcacga i^^^ilij/'iv.vy-W 
tatctccaag atgctattcg ttgaacccat ' cctgg^ggtt, tccagcttgcr ^acaaic^ -1 ^; 120 
ctcaacaacc .aajttcagcca ccaaaat^acV agcta^^ .J.- 180 

accc^ccaca g^ aa ^ aa ^^ i^^^f^^; 240 

cccaaca^at tc^cctaccc agcccac^a^r tgcccaggac ^ ^^t^qfe 300 

ctgctctgac ttggai^ntc attcaacara a^c 360 

ctccctfiaag ; -ctctaccacg :5 cctt , ; ... 387 

<210> 77 ,., - . 

<211> 339 , - v ' V • 

<212> DNA ;• ^ 

<213> Homo sapien , ; . t -, 

<220> 

<221> ir.isc_f eature , 

<222> (1) ... (339) . 

<223> n = A,T,G or G 



60 
120 
180 
240 
300 



<400> 77 ; 
c^gctgatcn gggtcccttt ggagcacaga^ gccancnngg gacaachacg, 

tgatetgcgc ccjtggtcctg gt^cc^t^ 

canagac^tgrtncag^ggcq ccccgtgaaa gacagaat t<g tggt tt:tcqt ggtg^qapac^ 
cctcccantg .tgcaaataag ggctgrt^^ 

gcttctatcc taataccntc riachtcccnc canaaaagga ntgtgaatfct . tanacac^tP 
tgcagggatc tgcctgcatc ctgacgcngt gccgtcccc 339 

<210> 78 - - • , 

<211> 385 . He 

<212> DNA t -,. Tr &' < rr/' 

<213> Homo sapien 

<400> 78 ^ v , . ; : : ; :. ' 

tcggtcatag ggagagattt gtatgctgta ctatgcagcg tttaaa^fcta. gtgg^tttg 60 
tgatttttgt attgaatatt gctgtctgtt acaaagtcag-Xt^aa^t^c^tt^^tet 120 
ttaagttatt ctatcttgga gataaaatct gtatgtgcaa ttcaccggfta ttaccagttt 180 
attatgtaaa caagagattt ggcatgacat gttctgtatg tttcagggaa aaat:gt£ttt 
aatgcttttt;,caagaactaa c^ea^tatt^cc1;atactgg. at 1 1 t:agg^g jc^qaag^c n 
tgct<3gtg£t ) ^;taggaat§agj aatgt gcatg aagcctaaa^ i apca^gaaa; g^t^tactg 
aatttaagca jaaaaaaaaaa ,acccc ?; ; <: . r^- ' ■ ; '■•ioV* 

<2lb> 79 - ; . ; . y ; V" 

<211> 307 ^ ' *" 4. ^ •. -;-j>-- ':V-^":! 

<212> DNA 
<213> Homo sapien 



240 
300 
360 
385 
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<220> 

<22l> misc_feature 
<222> (1) . . . (307) 
<223> n = A,T,C or G 

t eg a t acag<j. : ga tg£ Cagag / ct gccagaga ct 1 1 at cc tg ' ' aag^t;t acc aaga t cagaa 6 0 

t cct gacaaa gnagaaag t c" * a t ct act etc act t cacat;g £ get acaga t aciagacaat a s 12 0 

ttcgctttgt gtttgetget gtcaaagaca caattctaca^^ 180 

acct tgt eta aaagctgctg cccactcc tc ccct ataaca gaagatgtga 1 1 1 tgcaaact 1 240 

ccttgtttta tttgnaagtg cttctgacat cnccagagcc agccccatgc caggaactaa 300 

ggatgtc 307 

<210> 80 

<211> 528 v ( 

<212> DMA 

<213> Homo sapien 

<220> ; ' , ' 

<22l> mis cofeature - -; 

<222> (l)...(528) 

<223> n = AJ.C or G - 

<40(^;^b ^ 2^ '2l' '\\ 2* : 2 : ^TL ' ; "f. ; ' '* ; 

gtcgatacag gaacagcatg tccaaatcga t^ 60 
tgccctgcgt gaegtaegtc agcaatatga aagtgtggct gccaagaacc tgcagga<jgc " 120 

agaagaatgg tacaaatcca agtttgctga cctctctgag gctgccaacc ggaacaatga 180 

cgccctgcgc caggcaaagc aggagtccac tgagtacegg agacaggtgc a^tccctcac 240 

ctgtgaagtg gatgecctta aaggaaccaa tgagtccctg gaacgecaga tgcgttgaaa 300 

tggaagagaa etttgeegtt gaagctgeta actaccaaga cactattggc tgectgeagg 360 

atgagattca gaatatgaag ganggaaatg gctcgtcacc ttegt^aata Ccaagacctg 420 

ctcaatgtta agatggcect tgacattgaa attgccacct acanggaact getggangen 480 

aagaaaacca ggatttctct gcctcctccn aacttttcct .cccctgraa 528 

<2io> 8i "■" ^ ; : 

<211> 369 . ■ ' '*' x r ' 

<212> DNA 

<2 13 > Horao sapien " ' 

■ fei l"\ 22222 .. 2 .2. 2 :r 2- 2 '2 \ V'2' A 2 " 

agcatggctc^ 6^aagfcttt*' gepa^aa c&tf pggatg^gt^ ^9!ccttct<gfgcf ' ca^g'c^fCctg 60 

cgaaatcata;:t ; 120 

cgtgtggctg ^^^^ gc&gatttcV ^ *' 180 

aaagat 1 1 tg aggagaigiag ^pa^tyfet^gt at ett t:caga gt gt a^a^ta a t cttggaat M 240 

ataaagaatt tcttca^ ' 300 

tatgacacat gaatatgtgg gctaagaaat agttcctctt gataaataaa 1" daattiaaeaa 360 

aaaaaaaaa 369 

<210> 82 r 
<211> 269 ; 
<212> DNA 1 • 

<213> Homo sapien . 
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<221> raisc_feature 
<222> (1) . . . (269) 
<223> n = A,T,C or G 

<400> 82 

atgacaggga tgancaaact tngtctgggg tattgatgaa gatgacctac tgctgatgat 60 

accagtgctg ctgtaactga agaaatgcca ccccttgaag gagatgacga cacatcacgc 120 

atggaagaag tagactaatc t&ggctrgag ggatgactta cctgttcagft actctacaat 180 

tcctctgata atatattt^ ttctttattt: ttgtt 240 

gtntggiiatg ataactnctt taaggggaa \ " i 1 V .... .' ; . • 269 

<2i6> 83 "'" ' ' '"' ■ ..-^ ■■■■^ 

<211> 196 

<212> DNA ;> , 

<213> Homo sapien 

. <220> : V'"' . 

<221> misc_feature 
<222> U> ...(196) 
<223> n = A,T,C or G 

<400> 83 

tttgggtcca attacagcta aagcaaaagt ggttattgaa ctgtttttat cggtctcggg 60 

rmttgctaaa ccttcccagg tgtattttgg aggtacagtt gttggcnagc aagctatnaa 120 

atctgaagat gaagtgggaa gttnaatana gtatgaatnc agggtaagaa ac triaggtaa 180 

acetcnaata tncctc .. v . • '[ : : . - 196 

<210> 84 , ; . ' 'V--' - - ■ ■ . .... ' ,,. ., v ' - 

*211> 448 '' ] ' ; \' , . - " ... 

;<il2>.p^ : ; ^ : ^v. ■ ■. * • ■ * ■ i'. -^v - - W - ... 

<213> v ^ ^ ... ,.-,'./,;.-- - —'>^ 

<22o> - : \;,V r,;.l .. ■ . " " ,\ . , 

<22l> misc_reature 
<222> (1) . (448) 

<223> n - A,T,C or 6 < - 

<400> 84 . / 

caaacatggg catggtgtca gcgataatgt ttntancagc tcccgacata aiatcagtaan 60 
tnngatttcc accatatcna ncntcnggaa tttaaccntc aggagnagct cttrmtcaga 120 
cnccctggaa aaacgagccc cattgnancc anctjttgana cataaaacct ggagaaattc 
tccaatacng aaggtataria g^ ^cSfggtc^; aag^^tc^g 

gaggctCagg cctgiraaa^ P^99^^9^ 

caggacgcagtctc^^ 360 
cagaaagcct tcata^ctt p^^F^gtg aa^a^sfea agaae^tgte agcagcaga^f *™ 
gcagtgaaaa aaaaaaiaaqc . ■ ^ a* 1 •- '.; r . iitr v.:, s: 



180 
240 
300 



420 
448 



<210> 85 

<211> 169 

<212> DNA 

<213> Homo sapien 



<400> 85 

agcagaccaa ctgccttttg tgagaccttc ccctccctat ccccaacttt aaag^tgtga 
gagtattagg aaacatgagc agcatatggc ttttgatcag tttttcagtg gcagcatcca 



60 
120 



WO €0/22130 PCTAJS99/M222 

31 

atgaacaaga tcctacaagc tgtgcaggca aaacctagca ggaaaaaaa 169 
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